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R. JIM TWA 


and 


MYRNA GREENE 
The University of Lethbridge 


Prediction of Success in Student Teaching 
as a Criterion for Selection in 
Teacher Education Programs 


If a faculty of education is to select the best candidates for its teacher education pro- 
gram, it is necessary to be able to predict performance in student teaching. To 
determine their usefulness in predicting student teaching performance, five 
biographical particulars and forty personality test scores, obtained prior to 
candidates’ entry into the education program, were examined in this study. Although 
the nine variables which entered the prediction equation for the total group 
accounted for only 37 percent of the variance in student teaching grade, much more 
promising results were obtained when the group was subdivided by sex, and subject 
specialization. The results indicated that the use of any single selection criterion, 
such as GPA, should be discontinued in favour of multiple selection criteria combined 
mechanically to produce a prediction of the candidates’ grades in student teaching. 


Every profession attempts to select for its training program those candidates 
who will become the best professionals in their particular field. Historically, 
teacher education institutions have had the reputation of being little concerned 
about who should enter the teaching profession. There is no intention to imply that 
- faculties of education do not have entrance requirements. Most of the institutions 
do have selection policies, but very few limit enrollments. Haberman (1972) in his 
survey found that most of the 386 NCATE (National Council for Accreditation of 
Teacher Education) member institutions in the United States had some sort of 


Mrs. Greene has been employed for the past three years in the Faculty of Education Research Centre at the University of 
Lethbridge. She received a Bachelor of Education Degree from the University of Lethbridge in 1976. Prior to that, she spent 
several years as an instructor in medical-surgical nursing. 


Jim Twa is a Professor of Education at the University of Lethbridge where he has instructed in clinical supervision, 
analysis of teaching and practicum aspects of the teacher education program for the past nine years. He is presently 
coordinator of the University of Lethbridge’s early field experience practicum which serves as one of the screening processes 
for students entering the Faculty of Education. 


R. J. Twa and M. Greene 


entrance requirements. Three hundred and forty-four of the institutions which 
constituted his sample relied heavily upon a minumum cumulative grade point av- 
erage (GPA) and admitted almost everyone who had attained the minimum. If the 
alleged inflation of college grades (“Too Many A’s,” 1974) actually has occurred 
over the past few years, it follows that there are increased numbers of eligible, and 
generally lower quality, candidates being educated in ever-expanding programs. 
This condition, not conducive to producing excellent graduates, was discussed by 
Wisniewski at the 1978 Association of Teacher Educators’ Annual Conference 
(Wisniewski, 1978). 

In a recent comprehensive review of what the author (Menges, 1975) terms 
“assessing readiness for professional practice,” several generalizations are made: 


First, definitions of effective practice should emphasize many discrete behaviors and 
characteristics rather than global definitions. Second, measures of these characteristics 
(predictors) should be as similar to the criterion itself as possible. Third, multiple 
assessment devices should be used so that no single type is overemphasized. Measures of 
subject matter knowledge, in particular, should be less heavily weighed. Fourth, data 
should not be used for decision-making until longitudinal studies demonstrate adequate 
predictive validity. (p. 201) 


Teacher educators are not ignorant of the problems associated with present 
selection policies, and there appears to be increasing dissatisfaction with the 
situation. Two of the major speakers, Bruce Joyce and Karl Openshaw, at the 1978 
Association of Teacher Educators’ Annual Conference asserted that for significant 
improvement to occur in the education of teachers there must be a move toward 
improved selection procedures. In the face of the surplus of teachers, limited 
budgets, and the prospect of the necessity to limit enrollments, it would seem 
obvious that the institutions responsible for the education of teachers must 
increase their effort to develop selection criteria which are related to success in the 
field. The ability of a faculty to predict the success of its candidates from data 
available prior to their entry into the faculty would be a tremendous asset. The 
institutions’ resources could then be invested in the most promising candidates, 
and students who were identified as having low potential for success in teaching 
would not waste their time and money pursuing an ill-chosen career. 


With the present trend toward increased emphasis on the field experiences 
portion of teacher education programs, performance in student teaching is likely to 
become, if it is not already, the most important criterion in certification decisions. 
Faced with the oversupply of teachers and the consequent prospect of limited 
enrollments, the question of which students to admit becomes increasingly 
perplexing to schools of education. Prediction of performance in student teaching 
has been investigated by a considerable number of researchers employing a wide 
range of predictor variables. However, many of the studies dealt with only one 
category of predictor variable, such as grades, or personality factors, and many 
were one-shot research projects which were not replicated, thereby violating most 
of Menges’ generalizations. 

College grades, although the most widely used criterion for selection in teacher 
education programs, has received surprisingly little research attention as a 
predictor of student teaching performance. Those studies that have examined this 
predictor in various forms have generally reported a significant positive correlation 
between it and performance in student teaching (Greene, 1977; Emanuel, Laramore 
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& Sagan, 1975; Williams & Fox, 1967), but when used by itself, it has not proven to 
be particularly effective. 


Another widely used criterion for selection which has received very limited 
attention is English proficiency. Bedford (1971) found that a measure of written 
English had some predictive value when it was combined with other variables. Two 
others (Cortis, 1968; Warburton, Butcher & Forrest, 1963) using a somewhat 
related predictor, the vocabulary portion of intelligence tests, found it of no value. 
Intelligence tests in general were found to be poor predictors in these same two 
studies. 

Burroughs (1958), after making a rather detailed study of the interview, 
concluded that it was as good as any other predictor, a finding which was 
duplicated by Bedford (1971). Attitudes toward education, as measured by the 
Minnesota Teacher Attitude Inventory (Cook, Leeds & Callis, 1951), have been 
examined for their predictive value in a number of studies (Tarpey, 1965; 
Michaelis, 1954; Bedford, 1971; Williams & Fox, 1967). The first three studies 
reported the MTAI to have some predictive value while the fourth found it of no 
value. Tarpey and Bedford also found the Kuder Preference Record (Form C, 
1960) to have very little predictive power. 


In the area of personality and psychological measures, the Sixteen Personality 
Factor (Cattel, Elier & Tatsuoka, 1970) has been researched extensively (Davis & 
Satterly, 1969; Henjam, 1969; McClain, 1968; Morris, 1975; Tarpey, 1965; 
Warburton et al., 1963) and although the findings are not in complete agreement, 
all of the studies reported that it was one of the best predictors of success in 
student teaching. Michaelis (1954), using the Minnesota Multiphasic Personality 
Inventory, and Tyler (1954) using the MMPI and the Heston Personal Adjustment 
Inventory found them to be of little value as predictors. However, Gough, 
Durflinger and Hill (1968) reported considerable success in developing a prediction 
equation based on the scales of the California Psychological Inventory (Gough, 
1957). Two other researchers (Veldman & Kelley, 1965) also using the CPI, 
identified significant test differences between student teachers who were rated 
high and those rated low in student teaching performance. 


There appear to be conflicting results from the research in this area. However, 
by comparing some of the earlier studies (Sandiford, Cameron, Conway & Long, 
1937; Michaelis, 1954; Tyler, 1954) to some of the more recent (Gough et al., 1968; 
McClain, 1968; Morris, 1975), it is possible to discern the progress which has been 
made. Later studies attempted to predict performance in student teaching based 
on different personality configurations and characteristics for male and female 
subjects teaching in different situations, for example, elementary versus secondary 
level. The results indicate that these types of studies are producing significant 
findings and should be continued. 


The present study was based on two critical assumptions: (1) that it is not nec- 
essary to solve completely the “teacher effectiveness” problem before significant 
research can be done on the selection of candidates, and (2) that performance in 
student teaching is, or should be, the most important criterion upon which 
certification decisions are based. 

Faced with the realities of selecting and counseling teacher education 
candidates, the investigator conducted a study to determine the feasibility of 
predicting performance in student teaching, based on data collected prior to the 
students’ admission to the teacher education program. 
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Method 


The study is a part of an ongoing, longitudinal project of the type recommended 
by Menges, currently underway at the University of Lethbridge (Dravland & 
Greene, 1978). That larger study provided for the creation of a data bank which 
contains up to 800 variables on each of about 700 education students in three 
populations, created for purposes of replication and validation. The data bank 
contains information on the students’ biographical particulars, their two years in 
arts and science courses (students at the University of Lethbridge are required to 
spend two years in the Faculty of Arts and Science before being admitted to the 
Faculty of Education), scores on various English and psychological tests, 
performance in courses and practica in education, and data on their performance as 
teaching graduates. The data for this prediction study were drawn from the second 
population of that data bank. Data from a third population will be used for the 
purpose of validating the prediction equations developed in this study. 


The Sample 


The subjects for this study were the 180 students who were enrolled in the first 
course in the teacher education program at the University of Lethbridge, 
Lethbridge, Alberta, Canada, in the four semesters beginning Spring Semester, 
1974 through to the end of Summer Session 1, 1975. During the first week of the 
course, the tests were administered to those students in three two-hour testing 
sessions. The biographical particulars were extracted. from students’ records. As 
these students progressed through the regular teacher education program, their 
performance in student teaching was evaluated by the university supervisors who 
observed the classroom performance of the students for a minumum of two hours 
per week during their six-week full-time student teaching experience. By the time 
the data had been assembled, the number of students for whom there were no 
missing data had decreased to 141 — 74 females and 67 males. Of the 141 subjects, 
69 were classified as elementary and 70 as secondary, with two students 
unclassified as to route of preparation. This resulted in a further decrease to 139 
subjects for any analysis involving these two subgroups. 


The Variables 


The variables chosen for this study included five biographical particulars, 40 
test scores, and one measure of student teaching effectiveness: 
1. Biographical Particulars: 
Sex 
Route — Elementary vs. Secondary Preparation 
Age 
Admission GPA — Cumulative GPA at time of entry into the first course in Education 
Major GPA — the GPA in the teaching major at the time of entry into the Faculty of 
Education 
2. Test Scores: 
MTAI — percentile rank 
D-Scale — total score on a Dogmatism scale 
Q-Sort — score on an adjective Q-sort designed to measure self-concept 
16PF — raw scores on each sub-factor 
A 16PF — Cyclothymia vs Schizothymia Factor 
B 16PF — Intelligence Factor 
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16PF — Ego Strength Factor 
16PF — Dominance Factor 

16PF — Surgency Factor 

16PF — Conscientiousness Factor 
16PF — Extraversion Factor 
16PF — Sensitivity Factor 

16PF — Paranoid Factor 

16PF — Unconventionality Factor 
16PF — Sophistication Factor 
16PF — Guilt Proneness Factor 
Q, 16PF — Radicalism Factor 

Q, 16PF — Self-sufficiency Factor 

Q; 16PF — Self-control Factor 

Q, 16PF — Id Pressure (Tenseness) Factor 


TAT — achievement motivation score from the Thematic Apperception Test 
College English — Placement Test — percentile rank 

Convergent Question — score on a Convergent Question Test 

CPI — standard score on 18 sub-scales. 


Do CPI— Domination Scale 

Cs CPI— Capacity for Status Scale 

Sy CPI — Sociability Scale 

Sp CPI— Social Presence Scale 

Sa CPI — Self-Acceptance Scale 

Wb CPI — Sense of Well-Being Scale 

Re CPI — Responsibility Scale 

So CPI — Socialization Scale 

Sc CPI— Self-Control Scale 

To CPI— Tolerance Scale 

Gi CPI— Good Impression Scale 

Cm CPI — Communality Scale 

Ac CPI— Achievement via Conformance Scale 
Ai CPI— Achievement via Independence Scale 
Ie CPI — Intellectual Efficiency Scale 

Py CPI — Psychological Mindedness Scale 

Fx CPI — Flexibility Scale 

Fe CPI— Femininity Scale 


3. Grade in Student Teaching! — final grade in a six-week student teaching practicum 


OZZr4teastHa 


Numerical Value Letter Grade 
1 F, I, W (Failing, Incomplete, Withdrew) 
3 C (Average) 
4 B (Superior) 
5 A (Excellent) 


Limitations 


A stepwise multiple regression procedure was used to “isolate a subset of avail- 
able predictor variables that will yield an optimal prediction equation with as few 
terms as possible” (Nie, Hull, Jenkins, Steinbrenner & Bent, 1975, p. 345). In the 
initial analysis emphasis was on producing the most effective “mechanical” model 
for the selection of teacher candidates. There are obvious ethical and measurement 
problems associated with selection schemes and quotas. There was no attempt in 
this study to deal with these issues. Nor was there any attempt to analyze the 
psychological characteristics implicit in the regression equations. The practicality 
of applying the model will be dealt with in a further study. 
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For a variable to be included in the regression analysis, the F value for the 
difference between multiple correlations had to be significant at the .05 level. The 
degrees of freedom used to test the significance were (m, — m,) and (N - m, - 1) 
where m, is the larger number of independent variables and m, the smaller number 
of independent variables (Guilford & Fruchter, 1973, p. 368). 


Also included in the discussion are those variables which are correlated with the 
criterion at the .05 level of significance. The sample was examined first as a total 
group and was then subdivided for further analyses. 


Results 
The Total Group 
The means and standard deviations of the criterion and predictor variables for 
the total group are presented in Table 1. Space considerations do not permit 
inclusion of the complete correlation matrix; however, it is available to any 
researcher wishing to reproduce this work. 


Of the total 45 predictor variables available, only 12 qualified for consideration 
(See Table 2). With the F value for inclusion set at 3.92, the regression equation 
produced a multiple correlation of .609 with a standard error of .82254. The 
predicted grade in student teaching was as follows: 

Grade = 1.64481 + (-.00255)FxCPI + 

(.16554)Q, 16PF + (.54311)Major GPA + 
(.10243)F16PF + (.02184)DoCPI + (-.03121)AiCPI (1) 
+ (.02964)CmCPI + (—.11572)016PF + (—.02129)SyCPI 

Examining the makeup of the regression equation and the order in which the 
predictor variables entered the equation, one might wonder why a variable such as 
the Q-Sort which was correlated at the .01 level of significance with Grade in 
Student Teaching, did not enter the equation, while So CPI with a correlation of 
.092, did enter. There are two conditions under which the variables may prove to be 
valuable as predictors in the multiple regression process. The first occurs when two 
predictor variables have high correlations with the criterion and low correlations 
with one another. The other situation in which a variable may prove to be valuable 
in the equation occurs when it acts as a suppressor on one of the other predictor 
variables which has previously entered the equation. In this case the suppressor 
variable has a low correlation with the criterion and a high correlation with one of 
the other predictor variables.? Examples of these situations are shown in Table 3 
which identifies the intercorrelations of the variables in question. For example, the 
Q-Sort was significantly correlated (r = -—.206) with Fx CPI which entered the 
equation on Step 1. Apparently Q-Sort was accounting for much of the same 
variance in the criterion as was Fx CPI and therefore had its predictive value 
lowered when Fx CPI entered the equation. On the other hand, Sy CPI which pro- 
duced a low correlation with Grade in Student Teaching (r = .092) did enter the 
equation. Sy CPI was correlated with one of the variables which had previously en- 
tered the equation, Cm CPI (r = .118), and therefore appeared to be acting as a 
suppressor on that variable. In this way it became a useful predictor in the equa- 
tion while the Q-Sort did not. 


Dividing the Sample by Sex and Route 


Two previous studies (Gough et al., 1968; McClain, 1968) found indications that 
there are different personality configurations for successful male and female 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF CRITERION AND 
PREDICTOR VARIABLES FOR THE TOTAL SAMPLE (N = 141) 


SSS SSS 


Mean Standard Deviation 
Ee ee ere ee ee 
Criterion 
Grade in Student Teaching Slo 1/76) 1.003 
Biographical Predictors 
Sex 1.475 0.501 
Age 23.638 4.654 
Admission Grade Point Average Ads Tai 0.473 
Major Grade Point Average 2.829 0.516 
Route ie OF 0.502 
Psychological Test Score Predictors 
MTAI 47.220 28.700 
D Scale 101.972 25.291 
ORSORt 66.674 19.895 
A 16PF 30 SE) 1.559 
B 16PF 6.234 ae 
GeloPe 5.986 ilk; (SV 
E-16PF 5.064 1.810 
F 16PF 5.348 1.801 
G 16PF Syateieul 1.748 
H 16PF 5.688 1.860 
PeoORE 5) Bxoy? 1.778 
ey MEV 3g Sly? 2.022 
M 16PF 6.759 2032 
N 16PF Spo falal 1.882 
O 16PF Ho 53h) 1.973 
OV LGer 5.901 1.947 
Qo 16PF 3s SAS) Zee 
Q3 16PF 6.433 1.928 
Q, Lé6PF 4.957 736 
TAT Tak SIs! Ona 
Col Eng 60.355 DO LON, 
Con Quest 746, U2)8) 1303 
DOMGPT 53.589 10.970 
CenCEel 51.248 10.146 
Sii¢ (Cea 51.865 9.540 
Spacer 56.596 10.268 
Sve) (CIP 56.936 10.287 
Wb CPI 49.213 9.081 
Re CPI 45.652 9.179 
So CPI 48.716 10.088 
SO, KGAE 46.844 10.202 
HOmGEL 50.270 9.338 
Gi CPI 46.801 10.384 
Cm CPI SL AS 7.448 
Ac GPE Ob eo 9.820 
Avie GEAL 55.589 sha Vasil 
Ie CPI DL .823 9.367 
AD (GIZA SyS15)5}5)5) 8.848 
lable (Gee 55.844 tak {ey7/Al 
Fe CPI 51.504 10.083 
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TABLE 2 


CORRELATIONS BETWEEN PREDICTORS 
AND GRADE IN STUDENT TEACHING, AND RESULTS OF MULTIPLE 
REGRESSION ANALYSIS (N = 141) 


Correlation with Order of Entry Bonu tees 
Predictors Grade in Student into Regression R & 
Teaching Equation 
Biographical Particulars 
Sex (Male) -.182* 
Admission GPA 7200 
Major GPA eo we (3) 432 
Psychological Test Scores 
Q-Sort ~243%% 
F 16PF eelelels (4) 477 
O 16PF —.216* (8) aeyent 
Q, 16PF -26/7%** (>) aT 
Do CPI «243%%* (5) aeuls) 
Sy CPL -092 (9) -609 
Cm CPI - 205% (7) -566 
Ai CPI = s189% (6) a SHAY 
Exe cPL —.275%* (1) 2D 
Es yal KS AVS) 
Kk ee A(oul 
TABLE 3 
INTERCORRELATION MATRIX (N = 125) 
Grade in 
vere Student Fx CPI Ai CPI Cm CPI Sy CPI Do CPI Q16PF 0 16PF F 16PF Q-Sort Major Adm 
Names GPA GPA 
Teaching 
Sex (Male) -.182 156 -017.  -.028 .099 +101 -.120 =.160 -=.113 -181 -.147 -.168 
Adm GPA 216 = 055 +206 -O91 -050 =020 AE Si) =.003 So ES) - 102 -895 1.00 
Major GPA 1255 -.049 - 186 -.000 - 066 -028 «133 = 053 Le, 074 1.00 
Q-Sort +243 206 -081 264 eo, 279 -185 cS 16 3// =—,1093 1.00 
F 16PF celal - 006 = SAS 020 «295 +050 —2243 039 1.00 
O 16PF -.216 AOSPY  phiby -081 -.369 -.342 -.098 1.00 
Q; 16PF -267 O26 are de -.144 =.056 703.0 1.00 
Do CPI SPAY Tosa .100 .207 -606 1.00 
SyHCRL 092 -.028 050 .118 1.00 
Cm CPL .205 Sythe) -.047 1.00 
ALVCEE -.189 +438 1.00 
Bx CPE =. 2/95 1.00 
Note. For N = 125: .05 level of significance = .174 
-O1 level of significance = .228 
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student teachers, but Morris (1975) was unable to verify this finding in his study. 
Morris also was unable to identify differences between good elementary and 
secondary teachers and concluded that “the differences between good and poor 
teacher personality is the same regardless of sex or grade level taught” (p. 20). With 
a view to examining the discrepancies found in these previous studies, the 
investigator made additional analyses by dividing the sample by sex, by grade level 
taught, and by subject major or level or preparation. The following is a summary of 
a more complete analysis, the results of which are available from the author upon 
request. 


Females Only (N = 74). Employing 44 variables (Sex excluded) as available 
predictors and applying the same criteria for consideration as in the previous 
analysis, a total of ten variables qualified for consideration in this section. Again, 
with the F value for inclusion set at 4.00, the regression equation produced a 
multiple correlation of .699 with a standard error of .76964. Six predictors entered 
the equation. The equation for predicting grade in student teaching was: 


Grade = 1.95426 + (.19797)Q, 16PF + (.83980)Major GPA 
+ (.04419)\CmCPI + (-.04455)AiCPI + (-—.01184) (2) 
D Scale + (.02654)ReCPI 


There were several differences between this equation and Equation 1 for the 
total group. First, only four of the variables were common to both equations: Q, 
16PF, Major GPA, Ai CPI, and Cm CPI. Notable by its absence from Equation 2 
was Fx CPI which entered on Step 1 of Equation 1. Also missing were F 16PF, Do 
CPI, O 16PF, and Sy CPI. Two variables, Q-Sort and Re CPI, entered Equation 2, 
but did not appear in Equation 1 for total group. 

Males Only (N = 67). Using the same predictors and criteria, none of the bio- 
graphical particulars and only five of the test scores qualified for analysis. The 
equation produced a multipie correlation of .511 and a standard error of .86165 
with only two variables entering: 


Grade = 4.36615 + (.02105)Q-Sort 
+ (~.0419)PyCPI (3) 


Neither of these two variables appeared in the equations for the total group for 
females. Both were significantly correlated with the criterion (p < .05) but not 
with each other. 

Elementary Only (N = 69). When the regression equations for the elementary- 
secondary groups of subjects were examined, an interesting but perplexing finding 
appeared. Sex entered on Step 2 of the equation for predicting grade in student 
teaching for secondary level subjects. Because of the practical nature of this study 
and the obvious implications which would arise from the use of sex as a criterion 
for the selection of candidates for teacher education programs, sex was removed 
from the array of available predictors and the data were reprocessed with the re- 
maining 44 available predictors. 

Five variables qualified for consideration in the analysis of elementary-route 
students, but only four entered the regression equation producing a multiple 
correlation of .685 and a standard error of .77172. 


Grade = —1.28909 + (.27778)Q, 16PF 
+ (.06249)CmCPI + (.48064) (4) 
Major GPA + (-.02439)IeCPI 
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Secondary Only (N = 70). With the same variables and criteria as for 
elementary- route students, three variables entered the regression equation, pro- 
ducing a multiple correlation of .528 and a standard error of .85689. 


Grade = 3.66568 + (.01504)Q-Sort 
+ (~.04999)AiCPI (5) 
+ (.03515)IeCPI 


Dividing the Sample Into More Specific Combinations 

Sex and Route. A_ further analysis was completed for each of 
female-elementary, female-secondary, male-elementary, and male-secondary 
students. Numbers in each group were relatively small; consequently it was pos- 
sible to account for a large portion of the variance in student teaching grade. The 
multiple correlations and prediction equations produced were as follows: 


Female-elementary — N = 49, R = .768, standard error = .73181 


Grade -.53328 + (.29696)Q, 16PF 
+ (.07683)CmCPI (6) 
+ (~.00955)D-Scale 
- (03392) TAT 


Female-se sndary — N = 24, R = .698, standard error = .54042 
Grade = 28264 + (-.06733)SaCPI 


(.99046) Major GPA (7) 
Male-eleme.itary — N = 20, R = .596, standard error = .68224 
Grade = 6.37080 + (—.05282)PyCPI (8) 


Male-secondary — N = 46, R = .570, standard error = .87867 


Grade = 3.55811 + (.02643)Q-Sort 
+i(—.08330)E VOR (9) 


Subject Major Subgroups. Pursuing one step further the hypothesis “that suc- 
cessful teaching in different fields calls for a personality unique to each field” 
(McClain, 1968, p. 31), three subject major subgroups — physical education, social 
studies, and English — were selected for analyses. These three subject areas had 
the largest numbers of students and had generally been oversubscribed in terms of 
available teaching personnel at the University of Lethbridge. Again, numbers were 
small and a large portion of the variance was accounted for. The regression equa- 
tions were as follows: 


Physical Education — N = 24, R = .750, standard error = .72174 


Grade = 4.69066 + (1.38098) Major GPA 
+ (-.06163)PyCPI (10) 
+ (-.26474)B16PF 


Social Studies — N = 32, R = .932, standard error = .40064 


Grade = 3.04686 + (.05415)Q-Sort 

+ (—.03806)GiCPI 

+ (-.05446)SaCPI 

+ (4.52233)Admission GPA 

+ (-3.61880)Major GPA (11) 
+ (.14192)B16PF 

+ (-.04004)AcCPI 

+ (.12128)C16PF 
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English — N = 25, R = .900, standard error = .46941 


Grade = -1.04184 + (.05282)DoCPI 
+ (.18464)Q, 16PF (12) 
+ (.15745)L16PF 
+ (-.2567)E16PF 
+ (.23055)F16PF 


Discussion 


The major purpose of this study was to determine which of the variables, if any, 
were worthy of further examination as predictors of performance in student 
teaching. In the next phase of this study the prediction equations will be 
cross-validated on another population of students by comparing their predicted 
grade in student teaching with their actual grade. 


From the findings of this study, it was apparent that no single variable by itself 
was useful as a predictor. The best single predictor for the total group was Fx CPI, 
which accounted for only 7% of the variance in student teaching grade. Even for 
certain subgroups where there was a fairly high correlation between the criterion 
and one predictor (e.g., DoCPI at .641 for English majors), the proportion of 
variance accounted for was not high enough to be useful. 


Other interesting findings were certain “non-relationships.” For example, 
neither the College English Placement Test score nor the MTAI were of any value 
as predictors of grade in student teaching for any of the groups in this study. 
Several of the subscales on the 16PF and the CPI, which from their descriptions 
were expected to be useful predictors, did not enter any of the regressions. In other 
instances, two subscales with apparently conflicting descriptions were both signifi- 
cantly correlated with the criterion variable. These discrepancies can be accounted 
for by examining differences among groups. 

Another misleading finding which existed for the total group was a “masking” 
effect. An example of this was in the D-Scale (dogmatism). The correlation with 
the criterion for the total group was —.028, leading one to believe that dogmatism 
was randomly distributed with respect to grade in student teaching. However, for 
the female-elementary subgroup the correlation was —.340 (p < .05), and for the 
male-elementary subgroup the correlation was .535 (p < .05), thus producing a 
masking effect for the total group. A similar effect existed for other variables such 
as Major GPA. 

The findings of this study were to be regarded only as hypotheses to be tested 
in the next stage of the project. Also, for those subgroups with a very small sample 
size, the procedures employed may have capitalized on artifacts of the data. 
Keeping these concerns in mind there appear to be two general findings worthy of 
note: 

1. The more specific one becomes in specifying the sex, grade level taught, and 
subject taught, the more accurate the prediction of grade in student teaching. 

2. To become more accurate in predicting grade in student teaching, one must in- 
clude as predictors psychological and personality test results in addition to 
biographical data. 

A desirable feature of any selection procedure is that it employ the minimum 
number of predictors which would yield the highest possible prediction. The 
decision to include or exclude a certain variable in the practical application of a 
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prediction model necessitates weighing the increase (or decrease) in predictive 
power against the expenditures required to obtain the data for that variable. One 
must also consider the ethical and moral implications of the “invasion of privacy 
issue” raised when any tests are used for selection purposes. No attempt to deal 
with such issues has been undertaken in this paper.’ All of the regression equations 
reported in this study must be cross-validated on a subsequent group of subjects 
before more than tentative confidence can be placed in the results. However, the 
findings of this study indicate that further investigation in this area is definitely 
warranted. 


Notes 


1. To date there is no measure of the reliability or validity of the grade in student teaching. Other studies within the 
longitudinal project are examining those concerns. 


2. For expansion of this idea, see Wiggins (1973), Chapter 1, “The Model and Its Application.” 


3. The authors have created several hypothetical prediction models based on a rational inclusion or exclusion of various 
variables. 


The authors wish to acknowledge the personnel of the Faculty of Education Research Centre at the University of 
Lethbridge and the Research and Development Centre for Teacher Education, University of Texas at Austin for their 
assistance with the preparation of this report. 
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A Critical Look at the WISC-R with 
Native Indian Children 


The Wechsler Intelligence Scale for Children (WISC-R) was administered to 177 
Native Indian children from three different bands in southwestern British Columbia. 
Item responses and difficulty gradients were evaluated and discussed for nine of the 
subtests. The raw score distributions on each subtest for ages six through twelve years 
were converted into scaled scores with a mean of ten and a standard deviation of three 
(comparable to Wechsler’s scaled scores for the original test). These scaled scores are 
available on request from the first author. A large number of items, especially from 
verbal subtests, were found not to contribute significantly to the total test variance. 
The implications of these findings are discussed in relation to clinical use of the 
WISC-R for Native Indian children. 


The clinician charged with the psychological assessment of ethnic or minority 
groups is frequently faced with the problem of employing intelligence tests which 
have been standardized on a representative sample of the general population. 
Although a particular ethnic group may be part of the standardization sample, the 
pattern or profile unique to that group is lost in the overall population variance. 
Thus, intra-group comparison, i.e., interpretation of the individual’s score with ref- 
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erence to his age and ethnic group is rendered impossible from the norms 
published for the population at large. 


The reasons for differences between groups of different ethnic origin in 
performance on tests of general intelligence have been the topic of wide speculation 
but are not a major focus of the present study. A number of parameters affecting 
differential performance among cultural groups have been identified. They can be 
roughly divided into test taking attitudes such as motivation to succeed, test 
anxiety, cultural values, and test content such as differential familiarity with cer- 
tain tasks and particularly varying experience with verbal material in general. 


Differential test performance between majority population and certain ethnic 
groups has been partialled out or dealt with in several ways. First, the addition of a 
constant to an individual’s obtained score can be used to adjust for systematic 
differences from published norms (Droege, Botterbusch & Hawk, 1969). Second, 
ethnic and socioeconomic variables can be treated as moderators (Gaddes, 
McKenzie & Barnsley, 1968; Guion, 1966). Individuals are categorized on the basis 
of race, socioeconomic level, or other relevant factors with test scores and their 
interrelationships investigated within each group. A third approach involves 
constructing culture-fair tests with the objective of minimizing differences between 
ethnic and socioeconomic groups. The movement to develop culture-fair tests 
began in the 1950’s but did not resolve the clinician’s dilemma. To begin with, 
culture-fair tests vary considerably in terms of cognitive functions they purport to 
measure and further, they usually omit tasks related to school learning which are 
critical in the assessment of children. 

The described problems are particularly apparent in the assessment of 
indigenous persons. The performance of Native Indian children on most 
standardized intelligence tests indicates dull normal functioning characterized by 
average perceptual-motor skills and poor verbal skills (Goodenough, 1962; Peters, 
, 1963; Vernon, 1966; MacArthur, 1968, 1969). 


MacArthur (1969) gave group-tests to North West Territory Indian and Métis 
youths. Average or above average scores were demonstrated on the performance 
tests while low scores were attained on the verbal tests. Other investigators using a 
variety of tests with a number of Native Indian groups also found these children to 
be superior to their White counterparts on performance tasks and inferior on 
verbal tasks. 

Cazden and John (1971) cite several examples of superiority on tasks such as 
visual imagery, skills in interpreting photographs, learning through imitation, and 
the Draw-A-Man test. On the other hand, Gaddes, McKenzie and Barnsley (1968) 
found no statistical support for the hypothesis that North American Indian 
children possess superior spatial imagery as measured by a Wechsler Intelligence 
Scale for Children (WISC) subtest and the Porteus Maze Test. Brophy and Aberle 
(1966) suggest that the Native Indian concept of time, their disposition to conform 
to nature, and their withdrawal in school are three sources of value conflict which 
limit verbal abilities. It is also suggested that children who are reared in a culture 
which provides less experience of literacy behaviour and less awareness of the act of 
speech can be expected to experience greater problems on the verbal portion of 
intelligence tests as well as greater cognitive confusion about reading and writing 
(Downing, Ollila & Oliver, 1975). The problem is compounded by the presence of 
varying degrees of bilingualism in Native Indian children. 

A Canadian study by Taylor and Skanes (1976) compared Inuit and White 
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children, matched for grade placement and environmental circumstances, on 
verbal-educational tasks, spatial abilities, and inductive reasoning. After only one 
year at school, no difference in verbal-educational tasks or inductive reasoning 
could be demonstrated. This finding, which emphasized the importance of 
environment on the development of abilities, has also been supported by Wilson 
(1973). In an early study, Rohrer (1942) hypothesized that if Native Indians could 
be matched with White children on socioeconomic status, cultural opportunities 
and comparable schooling, a difference in I.Q. scores would not be demonstrable. 
This hypothesis was confirmed. 

The Wechsler Intelligence Scale for Children-Revised (WISC-R) (Wechsler, 
1974) is a commonly accepted tool for evaluating general intelligence. However, 
relatively little research is currently available relating differential test responses to 
cultural groups. Such information regarding Native Indian children would be of 
value for a number of reasons. As a rule, psychologists consider not only total test 
and subtest performance, but often make clinical inference based on particular 
item responses. In addition, psychologists are interested in a scale which can reveal 
individual differences within a specific group. For these purposes it is necessary to 
know something of the discriminatory power of the individual items as well as the 
order of difficulty of items when the scale is applied to Native Indian children. 

The WISC-R manual cites correlational studies of the test with the Wechsler 
Preschool Primary Scale of Intelligence (WPPSI), the Wechsler Adult Intelligence 
Scale (WAIS), and the Stanford-Binet, Form L-M. The major difference between 
the original (1949) and revised form of the WISC is that the standardization 
sample for the latter includes non-Whites and is therefore more representative 
than the original test. Furthermore, the WISC-R has new administration and 
scoring criteria with an altered sequence of subtest presentation. Both forms of the 
test yield a full scale verbal and performance I.Q. with a mean of 100 and a stand- 
ard deviation of 15. Only 10 subtests of the WISC and WISC-R are considered 
mandatory. The I.Q. is calculated on the basis of five verbal subtests (Information, 
Similarities, Arithmetic, Vocabulary, and Comprehension) and five performance 
subtests (Picture Completion, Picture Arrangement, Block Design, Object 
Assembly, and Coding). 

Few studies have examined item responses of the WISC for a given population. 
Items inappropriate for Canadians were identified in studies by Spreen and Tryk 
(1970) and Vernon (1976; 1977). WISC protocols were collected on ninety 
disadvantaged children from varying cultural backgrounds by Vance and Gaynor 
(1976). The full scale I.Q.’s ranged from 80 to 95. Analyses indicated that while 
item difficulties compared rather well with those obtained with the standardization 
sample, there did appear a need for rearrangement of some subtest items particu- 
larly on the Vocabulary and Picture Completion subtest. It was also evident with 
this population that a large proportion of the items did not contribute to the total 
test variance. 

The purpose of the present study was to gather information on the performance 
of Native Indian children on the WISC-R. It was felt that this information would 
facilitate more accurate and meaningful assessment of Native children by clinicians 
serving similar populations. Specifically, the study was designed to compile mean 
raw scores for Native Indian children on the WISC-R, to evaluate the item 
responses and difficulty gradient of the WISC-R from the sample of Native 
children tested, and to restandardize the sample to comply with Wechsler’s test 
constructed for nine tests in the battery. 
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Method 


The subject population included 177 Native Indian children (93 boys and 84 
girls) ranging in age from 6 to 13 years, representing three separate bands in 
Southwestern British Columbia; Native children lived on reserves and attended 
regular schools allowing for their integration into the wider community. The first 
band was represented by 95 children, the second by 14, and the third by 68. 

The data were gathered in the summers of 1975 and 1977 by two male 
examiners. The WISC-R was given to each subject in a quiet area of the school or 
band office. Rapport with the children was established and maintained by the 
examiners outside as well as inside the testing situation by their continued 
presence among the Native children. Both examiners were housed on or near the 
reserves with the cooperation of the Native people. The purpose of the study was 
explained to Band representatives and feedback was given after testing had been 
completed. In the administration and scoring of the WISC-R the manual was 
carefully followed. 


Results and Analyses 
Data Analysis 


Single item response frequency and difficulty gradient were analyzed by coding 
items as 0 for incorrect and 1 for correct responses; partial or qualitative scores 
were not recorded separately. Items were considered to have been correctly 
answered if any credit could be allowed. While this procedure is likely to reduce 
individual differences in item responses it was followed to increase ordinal consist- 
ency of the test items as well as to simplify the analyses. 

For the restandardization of subtests, the distribution of raw scores at each age 
level was computed and converted to a scale having a mean of 10 and a standard 
deviation of 3. Minor irregularities at the extreme ends of each scale were 
eliminated by smoothing. 


Effects of Age and Group Membership 

An Age x Group (Band) Analysis of variance (ANOVA) was carried out on full 
scale scores to examine the effects of these two factors. The five thirteen-year-old 
subjects were omitted from the analysis as this age level was not represented in two 
of the three groups. Significant main effects were demonstrated for both age (F [6, 
151] = 3.42, p < .001) and group (F [2,151] = 3,86, p < .001). The Age x Group 
interaction was not found to be significant. The group effect may represent true 
differences between the three Native Bands, however the size of one group was 
quite small (N = 14). For the purpose of subsequent analyses the data from all 
three groups were collapsed to supply a larger sample size. Table 1 presents the 
means, standard deviation and univariate tests of significance for each subtest 
across seven age levels. 


Subtest Item Validity and Difficulty 

Tables 2 to 6 present the results of the item analyses on each of the verbal 
subtests. These have been tabulated in three columns as follows: (a) frequency of 
subjects having answered the items correctly, (b) the proportion of subjects 
answering the item correctly, and (c) the point biserial correlation coefficient (pb) 
between each item and the total weighted score (index of discrimination). Subtest 
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TABLE 1 


F VALUES, MEAN SQUARES, AND P VALUES FOR AGE FACTOR 
ON WISC-R SUBTEST RAW SCORES 


Variable F( 7, 168) Mean SQ p less than 
Information 20.28 165.42 0.00 
Picture Completion 19.61 278.27 0.00 
Similarities 9.56 171.81 0.00 
Picture Arrangement 14.09 1085.90 0.00 
Arithmetic 24.25 129.12 0.00 
Block Design 13.08 1231.08 0.00 
Vocabulary 1b We) 495.38 0.00 
Object Assembly 13.10 266.26 0.00 
Comprehension 7.30 252.02 0.00 
Age 
6 7 8 9 10 lab 12 
(WV = 21) (V = 23) (N = 30) (WW = 19) (V = 31) (V = 29) (V = 20) 
M SD M SD M SD M SD M SD M SD M SD 
eee (sane UTA O34) 2 oS 6.96 2.87 8.89 2.86 9.87 Stee UNO Mkey PARR ASRS 245) 
Picture 


Comple- Pano 4.03 1226 4.99 WA RLY sy Sty!) Wis 7/3} 3.36 16.19 SiGe hil 18.23 2.48 iemsley  YAa745) 
tion 
Similar- 


aes 3.09 2.82 5.13 4.14 6.26 3.68 6.52 3.94 8.12 5.24 10.06 4.16 11.40 4.01 


Picture 
Arrange- 6.80 7.06 12.39 9.78 Gabry Oe sis) Zee 9.20 Ze LS Cran i7/ 23.41 10.03 26.10 6.80 
ments 


miele 4.7) ocay) Wiesel ety) 9 7066) muse) 79.21 Woctes G10.09' Neer subtne0e 2.27, Mai soreiss 
rene 8.38, o4198e| 8.822 SA79% 214-500-9580. 421510610217. 24e83y 13536m 2555710254 ea 26,25 mess 
eal inbeey  GeGk SIGUE) Gael SURE) EGE ais SLO Oh eel OPT - RO ORES Sear 
eee 10.80) 95344 ° 16/13 “2ae7i) C1796 FS loow ot42: a.360h 20 er gros 2inssi aoe F210 a 29 
ssembly 

ea Geer asi | Pee ok) GLEE Sith? ain RE aaa ai) LSS 
ension 


Coding ~ 23°10 12307 31.00 13598 22.88) 8.01" 26.32° 1303" 33.19 T1538" 40510 11709” 43.5385 8.78 


Dian 4.00 2.26 rhea M y’acite] Tron US cz 8546 2.25 9.40 2.59 8.50 2.47 8.42 1.13 


Span 
Mazes 13.10 5.08 14.35 5.82 WEI) Sys 7/8) 21.46 3.00 19), 31 4.65 Philo ys) 4.30 23°16 “4535 


items are presented in the tables in the same sequence as they appear on the 
WISC-R. Tabulation of the data from the performance subtests, with the exception 
of coding, has been omitted in the interest of parsimony. 


Of the 167 subtest items tested, 157 were answered by at least one subject. 
From these 157 point biserial correlation coefficients, 95 are significantly different 
from 0 at the .05 level (approximately 60%). This moderate percentage of items 
showing significant validity is due in part to the limited age range sampled. In the 
distribution of item difficulties for 157 scorable items attempted by this 
population, 39 or approximately 25% of the items are within the difficulty range of 
.30 to .70 which is considered desirable from a test construction point of view 
(Anastasi, 1976). Sixty-eight of the 157 scorable items are in the extreme deciles, 
l1.e., upper and lower ten percentiles (50 of 104 verbal items and 18 of 53 
performance items). These items at the extreme deciles do not contribute signifi- 
cantly to the total test variance. 
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TABLE 2 


FREQUENCY OF CORRECT RESPONSE, INDEX OF DIFFICULTY, AND 
INDEX OF DISCRIMINATION FOR ITEMS ON INFORMATION 


it 172 o97 -06 16 4 02 -04 
2 174 -98 -04 iy 3 -O1 -03 
3 IL7/Ak - 96 -07 18 10 05 -09 
4 161 91 - 16 19 9 -05 -10 
5 124 «70 52)5) 20 5 02 -05 
6 TA) -67 -50 ail 3 01 -03 
if 110 -62 -65 22 2 O01 02 
8 110 - 62 -62 23 0 00 00 
9 98 -55 47 24 1 - 006 -O1 
10 98 =) 5 (a5) 25 2 -O1 02 
det 63 -35 29 26 1 - 006 -O1 
2 26 -14 -29 27h 1 - 006 soit 
13 33 -18 Agas) 28 0 -00 -00 
14 Wy) 10 eal 29 0 -00 -00 
15 18 -10 -20 30 0 -00 -00 


TABLE 3 


FREQUENCY OF CORRECT RESPONSE, INDEX OF DIFFICULTY, 
AND INDEX OF DISCRIMINATION FOR ITEMS ON SIMILARITIES 


Item ig P rpb 
ih ifsyl 84 19 
2 AS) -70 50 
3 104 58 42 
4 123 69 46 
5 ay) 3 HAL 48 
6 111 olay by? 
i) 95 SEY? 61 
8 93 10) - 66 
9 74 41 339 

10 is) -08 gilli 
abil 12 . 06 sls} 
fee. 9 205 ov Alla) 
13) 5) -02 05 
14 4 02 -04 
NS 6 03 - 06 
16 3 -O1 03 
Wy it - 006 eal 


Subtest Item Placement | 

Information. The data shown in Table 2 indicate some inconsistency of item 
placement relative to the WISC-R format. Items 13, 18, and 19 appear to be 
misplaced in order of difficulty. Item 13 (stomach) was demonstrated to be easier 
than item 12 (America). Items 18 and 19 are easier than items 15 and 16 for this 
population. The American history content of items 15 and 16 may be responsible 
for this finding as was demonstrated by Vernon (1977) using the WISC-R with 
Canadian children. A substantial increase in difficulty was noted from item 11 
(seasons) through the remaining items of the test. 
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TABLE 4 


FREQUENCY OF CORRECT RESPONSE, INDEX OF DIFFICULTY, AND 


INDEX OF DISCRIMINATION FOR ITEMS ON ARITHMETIC. 


Item ft 
1 170 
2 171 
3 164 
4 147 
5 157 
6 156 
7 141 
8 177 
9 99 

10 D2 
11 65 
12 48 
13 28 
14 26 
ie 4 
16 6 
ey; 1 
18 1 
TABLE 5 


FREQUENCY OF CORRECT RESPONSE, INDEX OF DIFFICULTY, 
AND INDEX OF DISCRIMINATION FOR ITEMS ON VOCABULARY 


1 165 =93 -14 ily 
2 167 94 o12 18 
3 167 94 o12 1g 
4 162 ahi yi 20 
5 170 “90 -06 Pail 
6 161 SO - 12 22 
ii 128 SH -29 23 
8 134 75 40 24 
9 142 - 80 +29 “25 
10 126 Tl -40 26 
iat 128 ~72 ~29 27 
12 82 - 46 sol 28 
u3 Ue - 40 38 29 
14 45 -25 - 28 30 
ils; 25 -14 28 sii 
16 ng -10 Seal 32 


Similarities. With the exception of item 3 (shirt-hat), placement according to 
difficulty was found to be consistent with the WISC-R. There was a sharp decline 
in correct responses after item 10 indicating that items following contributed little 
to the total subtest discriminative ability for this population. 

Arithmetic. The difficulty level of items in this subtest closely approximates 
the order appearing in the WISC-R manual. Only items 4 and 10 appear to be 
misplaced with respect to ascending order of difficulty. If item 4 were placed in 
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TABLE 6 


FREQUENCY OF CORRECT RESPONSE, INDEX OF DIFFICULTY, 
AND INDEX OF DISCRIMINATION FOR ITEMS ON COMPREHENSION 


Item i 2) rpb 
if 162 Seal allah 
2 152 -85 REA 
3 154 87 24 
4 U5 2 85 -09 
5 138 78 cals! 
6 124 he 29 
7 96 Sys noe 
8 7/Al. 40 fal 
9 63 3S Sos) 

10 39 528) -10 
wii 24 4 25 
i 28 2 ee 
13 7 04 ON 
14 8 -04 -09 
15 7 04 07 
16 6 -03 - 06 
17 0 -00 -00 


second position it would not interfere with items 2 and 3 which require the child to 
cover up a number of trees. 

Vocabulary. Item placement for this population is at variance with the 
arrangement found in the WISC-R manual, particularly for items 5, 9, 17 and 18. 
The following was found to be more accurate in terms of ascending order of 
difficulty for this population: 5, 2, 3, 1, 4, 6, 9, 8, 7, 11, 10, 12, 13, 17, 14, 18, 15 and 
16. An abrupt increase in difficulty from item 15 on was noted. 

Comprehension. Inspection of the data in Table 6 indicates an order of 
difficulty closely approximating the WISC-R manual. Exchange of items 2 and 3 
for this population would appear to be warranted. An abrupt shift in difficulty 
level is noted from item 6 to item 8. 

Picture Completion. The arrangement of items on this subtest suggests that 
items 9 (ladder) and 10 (dresser) are misplaced in difficulty for this sample. The 
range of scores for this subtest indicates, however, that the discriminatory power 
for the majority of items is quite strong. 

Picture Arrangement. For this sample, only item 4 is misplaced with respect to 
ascending order of difficulty. Because of the item’s high discriminatory power, the 
placement error is not serious. There is a gradual shift from items which appear to 
be quite easy to items which appear relatively difficult, indicating a good range of 
scores. Most items contribute to the total score. 

Block Design. The first item of this subtest appears to be misplaced in regard to 
difficulty for the present sample. It is possible that very young children have 
difficulty in following examples in this subtest. Also, a child’s score is affected by 
the orientation of blocks. The last two items contribute little to the discriminatory 
power of the subtest for this sample. 

Object Assembly. Placement of items in order of difficulty is not as important 
for this subtest since all designs are given. Almost all subjects achieve some level of 
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TABLE 7 
FREQUENCIES OF SCORES TO CODE A AND CODE B 


Code A Code B 
Score Frequency Score Frequency Score Frequency 
8 iL a 1 37 6 
ae ik 3 I 38 4 
13 I 14 Ik 39 5 
15 2 16 ib 40 6 
7) 1 v7 3 41 3 
18 ) 18 il 42 2D 
20 1 20 3} 43 4 
21 x 21 2 44 2 
22 2 2, 3} 45 2 
24 1 /2e} 3) 46 2 
26 iL 24 4 47 2 
ZT 4 25 4 48 2 
29 1 26 3 49 1 
30 2 27 3 byl 2 
31 3 28 3 53 il 
34 1 29 5) 54 ib 
35 2 30 2 55 1 
36 3 Sul 5 Sy} i 
38 ji a2 5 58 3 
44 dk 33 4 59 ih 
46 1 34 5 62 iL 
48 ah 35 2 64 i 
36 8 
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i] 
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success on all items. For this sample, item 2 (horse) was more difficult than the 
others. 


Coding. Since this subtest does not present an increasing item difficulty 
component, the results are presented in terms of frequency with which scores of 
Coding A (for 6- and 7-year-olds) and Coding B (for children 8 years and older) 
were demonstrated in this sample (Table 7). 


Scaled Scores for Native Indian Children 


Scaled score equivalents were calculated on the raw score distribution for each 
subtest and for each age in the sample. In order to retain comparability with the 
traditional use of the test, a mean of 10 and a standard deviation of 3 were used for 
each subtest. The scaled scores for the subtests can thus be added as instructed by 
the Wechsler manual and the sums converted into I.Q. equivalents.! This 
procedure allows the average Native Indian child of a given age to obtain an I.Q. of 
100. For a quantitative interpretation of the relationship between I.Q., scaled 
scores, and deviation from the mean, see Wechsler, 1974, Chapter 3. 


Discussion 


The present study examined the performance of Native Indian children on the 
WISC-R. Previous findings were confirmed insofar as these children as a group dis- 
played average perceptual motor skills and poor verbal skills. However, of greater 
significance for clinical interpretation were the large number of items which did 
not contribute significantly to the total test variance. A number of factors may be 
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responsible for this phenomenon. First, as previously mentioned, the age range of 
children sampled (6 to 13 years) represents an attenuation of the full age range for 
which the test was constructed. Second, the items dealing with American history 
which were found to be invalid for the Canadian population by Vernon (1977) are 
no doubt equally inappropriate for the present sample. Vernon’s substitute items 
are therefore recommended for use with Canadian Native Indian children. Third, 
the Native Indian children in the present sample live on reserves where indigenous 
cultural values most likely prevail. Competitive aspects of life (particularly of an 
academic nature), orientation toward speed, accuracy, and verbal fluency play a 
minor role in Native Indian culture. Familiarity with the spoken and written word 
is compounded by a degree of bilingualism as the elders frequently converse in 
their native language. A further factor to be considered is the extent to which 
marginal socioeconomic status is evident among the Native Indian population. 
This variable was not formally evaluated in the present study; however, seasonal 
fishing is known to be the prime source of income for the Bands sampled. The val- 
ues and attitudes reflected on the verbal subtests of the WISC-R, particularly 
Information, Comprehension, and Vocabulary are still those of the White middle 
class population. 


The implications of the above findings are that clinicians using the WISC-R 
with Native Indian children should interpret their findings with extreme caution. 
The norms developed from the present sample are based on small numbers and a 
limited geographical region (Northern Vancouver Island). They should therefore 
be used with these factors and the previously discussed findings in mind. Because 
of the poor item validities reported, especially on verbal subtests, individual profile 
analysis as a tool of clinical inference should be avoided. 


This study has not solved the clinician’s dilemma in dealing with psychometric 
assessment of ethnic groups such as the Native Indian population. However, it has 
pointed out the dangers inherent in employing the WISC-R and using the norms 
developed on a large number of persons representative of the general population. 


Notes 


1. The raw scores and scaled score equivalents across all ages of the sample are available on request from the first author. 


This project has been funded by the Glendale Lodge Society, Victoria, British Columbia. The testing was carried out by 
V. Lahmer and R. Tucker. The authors are grateful to Ms. N. Nelson for many valuable discussions and suggestions. 
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Cognitive Interactions in Science: 
Classroom Practices and some Prescriptions 
of Theories of Learning 


An experimental analysis of the cognitive interactions of 320 lessons taught by 54 
science teachers at 6 secondary grade levels is reported. Variance in classroom 
practices was explained in terms of a number of dimensions of cognitive teaching 
style. 

From a study of changes in these dimensions across the grade levels, it was 
concluded that observed practices were generally harmonious with the notion of a 
hierarchical progression from knowledge of facts and principles to applications and 
problem-solving activities. However, low levels of practical activity in the early 
secondary grades revealed considerable dissonance between classroom practices and 
the prescriptions of Piagetian theory. 


During the past decade classroom interaction analysis has become firmly 
established as a distinctive area of educational research. From rather chaotic 
beginnings, a systematic overview is now emerging of the types of studies which 
have been undertaken in science classrooms. 


Power (1977) provides an interesting map of the territory of science classroom 
interaction studies and he suggests a useful classification of studies, according to 
the selected aspects of classroom life upon which attention is focussed. Affective 
studies are those attempting to characterize the socio-emotional climate of the 
classroom, whereas managerial studies focus on organizational, procedural and 
disciplinary aspects of the classroom processes. Cognitive studies, which seek to de- 
scribe the intellectual transactions taking place in the classroom, form a third 
category of Power’s classification. 

In retrospect, it is perhaps unfortunate that the great majority of the early 
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Education, The University of Western Australia. Dr. Hacker’s current research interests include classroom interaction 
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studies undertaken tended to focus rather narrowly on either the socio-emotional 
climate or on specific managerial issues. Many of these studies employed 
Flanderian-type observation systems (Flanders, 1970) or extensions and 
modifications of these systems. Of course it cannot be denied that effective class 
management and a healthy emotional climate are essential prerequisites to learn- 
ing, but they are not sufficient. Given adequate teacher management skills and a 
reasonable rapport between the teacher and the class, it can be argued strongly 
that the effectiveness of science learning and teaching must depend ultimately on 
the quality of the intellectual transactions which occur, that is, upon the types of 
cognitive skills which students are able to practise in the classroom. 


Surprisingly few empirical studies of the cognitive aspects of science teaching 
have been undertaken and there is a clear need for more descriptive and 
correlational studies to tell us how classrooms do vary and how these variations 
might relate to the cognitive development of students in science. 


This study aims, via an experimental analysis of classroom behaviours, to 
identify and quantify dimensions along which cognitive interactions vary from one 
classroom to the next. Also, by examining trends in variations across different 
grade levels, the study aims to provide a basis for the comparison of classroom 
practices with some prescriptions of theories of learning. 


A review of relevant educational and psychological theory provides a number of 
potentially useful distinctions between different kinds of cognitive behaviours, 
upon which an observational instrument might be built. For example, one 
dimension along which the cognitive behaviours found in science classrooms might 
vary is Guildford’s (1956) convergent/divergent dimension. Another distinction, 
provided by the work of Bloom (1956), Gagné (1970) and others, is the notion of a 
hierarchical progression from knowledge of simple facts and principles to applica- 
tions and problem solving activities. Flanders (1970) delineates teacher-dominated 
interactions and those interactions which are initiated by students. Armstrong 
(1910), Bruner (1960), Schwab (1962) and others differentiate between didactic 
and heuristic interactions and, in addition, the studies of Inhelder and Piaget 
(1958) emphasize the importance of a progression from concrete, experimental 
experience to more abstract, theoretical understanding. 


This review identifies a number of different dimensions along which the 
cognitive interactions of science lessons are likely to vary. It also suggests that an 
adequate description of classroom practices is more likely to result from the 
development of multidimensional profiles than from the use of vague and global 
dichotomies, such as “traditional” and “progressive” which have previously been 
used to describe teaching. 

It is clear that the selection or development of an observational instrument for 
this type of study merits careful consideration and Rosenshine and Furst (1973) 
provide a useful series of suggestions to aid the would-be classroom researcher in 
this decision-making process. Many of the earlier systems which were developed 
were rejected because they lacked any explicit or implicit theoretical or empirical 
base. Indeed many of these systems seem to have been developed from little more 
than the author’s fantasies about classroom life. Observational instruments based 
on one established educational or psychological theory were also considered to be 
unsuitable because of the restrictions which this framework would inevitably 
impose on the data which were subsequently recorded. Instead, as outlined above, a 
multidimensional instrument was sought, which would facilitate recording of a very 
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broad range of those cognitive behaviours which might be expected to occur in 
science lessons. 


Before developing a new instrument and adding, perhaps needlessly, to the 
great array of systems which have been developed for observing classrooms, a 
search was undertaken to see if any existing system might be located which would 
serve the purposes of this inquiry. The main selection criteria would be, firstly, that 
the instrument should provide quantitative measure of the frequencies of a wide 
range of cognitive behaviours and, secondly, that the instrument should have been 
specifically developed to study science lessons. The latter criterion was introduced 
because general systems have often failed to cope adequately with the unique 
aspects of science lessons in a laboratory context. 


Eventually the Science Teaching Observation Schedule (STOS), developed by 
Eggleston, Galton and Jones (1975), was selected as being well-suited to the re- 
quirements of this study. The STOS instrument was developed initially from an 
empirical analysis of videotaped science lessons and was specifically designed to 
differentiate between different styles of cognitive interaction in the science 
classroom. It is concerned only with the cognitive dimensions of classroom 
interactions and does not seek to quantify either socio-emotional or managerial 
aspects of classroom communication. 


The design of the schedule is based on two explicit assumptions: firstly, that 
recent developments in science teaching accept the processes of science as an 
integral part of science learned in schools and, secondly, that student cognitive 
growth in science is determined largely by the opportunities which are provided in 
the classroom to practice the various intellectual skills which are associated with 
science and scientific inquiry. 

~The 23 categories of the STOS instrument are shown in Table 1. To facilitate 
the recording of classroom events, observations are grouped according to those 
events which are initiated by the teacher and those events which are initiated or 
maintained by students. Teacher talk is divided into three major areas: teacher 
questions, statements, and directives; pupil activity is divided into two main areas: 
pupil seeks information or consults and pupils refer to the teacher. Each of these 
five main areas is subdivided into minor categories, the basic pattern being 
interactions which are associated with recall of facts and principles, applications to 
problem-solving, hypothesising and experimental procedures. One interesting 
feature of the teacher questioning categories is that the observer’s attention is 
focussed on the responses actually made by the pupils and those cognitive skills 
which they are practising. In this context, problem-solving is defined in a 
convergent sense where only one solution to the problem is acceptable to the 
teacher. 

This examination of the STOS categories shows that the instrument has the 
potential to provide simultaneous measures along a number of different 
dimensions of science classroom interactions. A speculative model, which relates to 
the assignment of STOS categories to three major dimensions of science teaching, 
has been developed by Eggleston et al. (1976). This model is shown in Figure 1 and 
gives the authors’ speculative assignment of the categories to three scales, repre- 
sented as 3 orthogonal dimensions. 

It is apparent that categories a,.7, b,., and c,, must, by definition, be 
teacher-directed or dominated and, conversely, that categories d, , and e, , must be 
pupil-directed. However, the arguments given for assignment of categories along 
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Figure 1. Speculative assignment of the STOS categories to a 3-dimensional model of science 
teaching. 


the practical-theoretical and heuristic/didactic dimensions are more speculative. 
By exploring underlying dimensionalities in observed classroom interactions, this 
study provides an opportunity to test whether the internal structure of classroom 
observations is adequately summarized by this model and, more specifically, to 
check the assignment of the STOS categories to the various dimensions. 


Method 
Sample 


The sample consisted of 54 teacher-class units from high schools across the 
three Maritime provinces of Canada. Nine teacher-class units for each of the grade 
levels 7 to 12 participated in the study and equal numbers of classes studying 
biology, chemistry, and physics topics, respectively, were included in this sample. 
Classes were generally unstreamed. At schools where classes were streamed at the 
higher grade levels, a class from the middle ability in science was selected. 
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Instrument 


The categories incorporated into the STOS instrument were described earlier. 
A sign system for recording data was used, with a time-sampling unit of three 
minutes. Each category was checked once if it occurred during this period. A user’s 
manual for the instrument and well developed training procedures for observers 
are available (Eggleston et al., 1975). 


Procedure 


Before the observational work was started, some pilot studies were carried out 
to see if any modifications to the STOS instrument would be necessary for use in 
Canadian schools. Observer reactions to the instrument were favourable and it was 
thought that valid distinctions between different cognitive teaching styles could be 
made with the standard instrument. 

Careful attention was given to the issue of observer reliability. The classroom 
observation work was preceded by a full week of intensive training in use of the 
STOS instrument. At the end of this week reliability trials were carried out 
according to procedures recommended by Eggleston et al., (1975). During 
observation periods in the schools further reliability tests were given and retraining 
sessions were held as required to maintain high levels of observer reliability. 

Contacts with schools were made via the superintendent of the school district. 
Principals were asked to forward the names of science staff who were willing to 
participate in the study. These teachers were asked to select six lessons which they 
considered to be typical of their teaching; visits to classes by different observers 
were randomized to avoid systematic observer errors in the results. 


Results 

For each teacher-class unit a percentage use figure was calculated for each of 
the 23 STOS categories. For example, if a certain teacher was observed for a total 
of 60 three-minute, time-sampling units and use of category a, was recorded in 45 
of those units, then that teacher’s use of category a, was recorded as 75 percent. 

To identify those underlying dimensions along which cognitive interactions do 
vary, a principal components analysis of this frequency data with a varimax 
rotation was carried out. Before undertaking this analysis, 4 infrequently-used 
STOS behaviours (a,, a7, c;, and d;) were discarded and the remaining 19 variables 
were subjected to a log transformation. This treatment was given to ensure that the 
distributions of input data for the factor analysis met the requirements of 
approximating to nonsingular multinormality. 

Five factors with eigen values in excess of unity were extracted from this 
analysis. These factors are described below. A varimax rotation of the 19 STOS 
behaviours is shown in Table 1. 

Factor 1 provides a measure along the knowledge/applications dimension of science 
teaching. It is a “problem-solving” factor which relates to a shift in emphasis in the 
classroom from knowledge of facts and principles to applications and 
problem-solving activities. 

Factor II relates to the familiar practical/theoretical dimension of science teaching. 
It is a “practical” factor which provides a measure of the amount of class time spent 
on demonstrations and class experiments. 

Factor III is concerned with the didactic/heuristic dimension of science teaching. It 
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TABLE 1 


STOS CATEGORIES AND VARIMAX ROTATED FACTOR LOADING 
FOR THE SAMPLE OF SCIENCE TEACHERS (N = 54) 


9 9) 


1. TEACHER TALK 


l-a Teacher asks questions (or invites comments) 
which are answered by: 


a, recalling facts and principles -33 
a, applying facts and principles to problem 
solving +70 
a, making hypothesis or speculation +79 
* ay designing of experimental procedure 
as direct observation +61 +85 
a, interpretation of observed or recorded data +67 
* a, making inferences from observations or data 
l-b Teacher makes statements: 
b, of fact and principle ~27 +73 
b, of problems +86 
b; of hypothesis or speculation +86 
b, of experimental procedure +90 
l-c Teacher directs pupils to sources of information 
for the purpose of: 
¢, acquiring or confirming facts of principles 31 +79 
ce identifying or solving problems +87 


| 
| 


* c, making inferences, formulating or testing 
hypotheses 


q, seeking guidance on experimental procedure +85 


2. TALK AND ACTIVITY INITIATED AND/OR 
MAINTAINED BY PUPILS 


2-d Pupils seek information er consult 
for the purpose of: 


gd, acquiring or confirming facts or principles -90 


a, identifying or solving problems 


* dy making inferences, formulating or 
testing hypotheses 


d seeking guidance on experimental procedure 


2-e Pupils refer to teacher for the purpose of: 
e, acquiring or confirming facts or principles 


@. seeking guidance when identifying or 
solving problems 


e3; seeking guidance when inferring, formulating 
or testing hypotheses 


e, seeking guidance on experimental procedure 


% of variance 


* Infrequently used categories which were excluded from the factor analysis. 
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is a “heuristic” factor which relates to the degree to which science is presented as a 
speculative and exploratory activity. 


Factor IV appears to relate to the teacher-dominated/pupil-initiated dimension, 
concerning the degree to which the initiative is held by the teacher in the 
classroom. 


Factor V is a teacher-questioning factor relating to the development of 
observational skills in pupils. 


For each teacher, a score for each of these 5 factors was calculated. These 
factor-scores were calculated from a direct summation of the logs of the percentage 
use figures rather than from a weighting of factor-score coefficients. 


A two-way multivariate analysis of variance of this data was carried out with 
subject areas (biology, chemistry, and physics) and grade levels (7 to 12) as 


TABLE 2 
SUMMARY OF THE MULTIVARIATE ANALYSIS OF VARIANCE DATA 


Source Multivariate F Factor Univariate F 


Subject Areas 


Biology v (Chemistry + Physics) 9.77 #k* ik 48.81 %*** 
II -49 
tet 1.97 
IV QBS 
V B24 
Chemistry v Physics 1.89 T TA 
init 2.03 
rey sage 
IV ifs 
V ha sys) 


Grade Levels 


Linear 20.61*** ue 34 ..98%** 
ie 10.02** 
BR is es} 
IV 39. 60*** 
V 3.48 
Quadratic 3.00% I 1.40 
Tit 5.03% 
EE Or sy? 
IV 6.54% 
V 0.08 
Cubic 6.94%kk iM 5.01* 
Il 6.34% 
apie OR ily 
IV 22.41%** 
V We 7/72 
OTHERS - - - 
INTERACTION s = = 
a eee. 
EDs 05 
x p= OL 
kk D < .001 
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independent variables. Previous research work (Eggleston et al., 1976) had pointed 
towards significant differences between biological and physical science (chemistry 
and physics) lessons. Consequently, Helmert contrasts were applied to levels of the 
subject area variable. Orthogonal polynomial contrasts were applied to levels of the 
grade level variable. The results of this analysis are provided in Table 2. Figure 2 
shows mean Factor I scores for biological and physical science lessons at different 
grade levels. Figure 3 shows trends in Factors I, II and IV across the grade levels for 
all science lessons. In addition the percentage of teacher-dominated interactions 
(categories a, 7, b, 4, and c,,) was calculated for each teacher. Mean values across 
the grade levels are shown in Figure 4. 


Discussion 


The source variables which emerged from the principal components analysis 
were readily interpreted in terms of recognizable and familiar dimensions of 
cognitive style, providing encouraging support for the speculative model proposed 
by Eggleston et al. (1976). 

Factors II, III and IV were directly related to the three main dimensions of the 
model, accounting for almost 40 percent of the variance between teachers. In addi- 
tion, Factor I, relating to a shift in emphasis from knowledge of facts and principles 
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— 
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—O-— Physical Science Lessons 
—A— Biological Science Lessons 


Figure 2. Mean Factor I scores for biological and physical science lessons across the grade 
levels. 
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Figure 3. Trends in Factors I, II, and IV across the grade levels for all science lessons. 
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Figure 4. Mean percentages of teacher-dominated interactions across the grade levels for all 
science lessons. 
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to applications and problem-solving activities, accounted for more than a quarter 
of the variance in cognitive style. This fourth dimension is proposed as a valuable 
extension to the basic model. 


With regard to the three dimensions of the basic model, some minor relocations 
of categories along the axes might be suggested. In particular, the association of 
category a, with the practical end of the practical/theoretical axis might be 
questioned. Questions answered by interpretation of observed or recorded data 
would seem to be more closely linked with heuristic teaching strategies. Indeed, 
observers commented that, during most practical lessons, teacher questions 
answered by observation were more usually followed by a teacher statement of fact 
or principle (category b,) rather than an invitation to interpret these observations. 
Also the association of categories b, and c, with heuristic teaching strategies may 
be weaker than the speculative model suggests. 


A further interesting point is that questions answered by direct observation 
would seem to occur in two different contexts. The first is the more common 
association with experimental work in a laboratory, while the second context, 
which probably accounts for the emergence of Factor V, is in association with 
materials, samples, or photographs brought to the lesson by students or teachers. 


Some evidence for the validity of the interpretation of Factor IV as a measure 
of teacher dominance was provided by the plot of the more familiar Flanderian 
index given in Figure 4, insofar as variations in this index and Factor IV across the 
grade levels were quite similar. 


The multivariate analysis of variance data (see Table 2) confirmed significant 
variation in one dimension to the science discipline being taught. Biology lessons 
were differentiated from physical science lessons by differences in Factor 1 scores. 
As shown in Figure 2, at all grade levels, lower scores for biology lessons point 
towards a greater emphasis on facts and principles, whereas in physical science 
lessons, higher scores indicate more emphasis on applications and problem-solving 
activities. 

Across the grade levels, significant trends in Factors I, II and IV were found 
(see Table 2 and Figure 3). It is interesting to note that no significant variations in 
Factor III were found, either with subject taught or grade level. The degree to 
which science is presented as a speculative activity would seem to be more closely 
linked with the individual teacher’s approach rather than with these discipline or 
pupil variables. 


For all science subject areas, Factor I scores increased across the grade levels, 
with a reversal of this trend, particularly in biology classes, at the grade 12 level. 
The general trend through the grade levels is harmonious with Gagné’s (1970) 
notion of cognitive development in science following a hierarchical progression 
from knowledge of simple facts and principles through to applications of these 
facts and principles to problem-solving activities. 


The trend in Factor II (see Figure 3) revealed considerable dissonance between 
classroom practices and the prescriptions of Piagetian theory. At the early 
secondary grade levels very little practical work is undertaken, yet Piaget’s 
observations and classifications of child development clearly indicate the need for 
concrete, practical experience at this time to encourage and facilitate learning. The 
general trend is towards increased amounts of experimental work at the higher 
grade levels. This trend is also reversed during the final year of schooling. 
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From a theoretical perspective, the picture of science education at the early 
secondary grade levels which emerges from this study is a disturbing one. High 
levels of teacher-dominance and low levels of practical activity in the early years 
are thought not to provide a sound basis for cognitive development in science. 
Indeed, there is substantial evidence (Duckworth, 1964; Green, 1971) to suggest 
that deprivation of practical experience at this stage can actually retard further 
cognitive growth. 


In conclusion, the sharp reversal of all observed trends during the final year of 
secondary schooling was an unexpected finding. These changes reflect a return to 
teacher-directed teaching strategies with renewed emphasis on knowledge of basic 
facts and principles rather than applications to problem-solving activities. 
Lungdren (1972) has proposed a model of teaching whereby it is viewed as a 
process governed by “frames” or externally imposed constraints which determine 
what is possible or not possible in the classroom. It seems likely that the changes 
observed during the final year may be linked with external factors, such as 
teachers’ perceived needs to complete a course of study, but further research 
activity is obviously needed to establish reasons for these variations in patterns of 
classroom interactions. 
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The Validity of the Estes Reading Attitude Scale 
for Intermediate Grades 


Affective development is being increasingly emphasized in learning, but problems 
arise in obtaining good measures of attitude toward school subjects, including read- 
ing. This study examines the internal consistency reliability and convergent validity 
of the widely used Estes Reading Attitude Scale (ERAS) for intermediate grades. 
Subjects were 1,403 children in 65 intact classrooms in nine elementary schools in 
Richmond, British Columbia. The Metropolitan Achievement Test — Reading (MAT), 
the ERAS, and two subjective measures, the Student Reading Rating Scale (SRRS) 
and the Teacher Reading Rating Scale (TRRS), were administered to all subjects. 
Results indicate good internal consistency reliability for the ERAS and solid 
convergent validity as indicated by the magnitude and direction of the correlations 
with the SRRS, the TRRS, and the MAT. The ERAS provides a useful global measure 
of school-oriented attitude toward reading in intermediate grades. 


Affective development is being increasingly emphasized as an educational 
objective. In their extensive review of a decade of research on the teaching of 
affective responses, Khan and Weiss (1973) concluded that “While the first wave of 
new curricula was concerned almost exclusively with cognitive outcomes, the most 
recent programs have been placing more emphasis on affective behaviours” 
(p. 788). 


The Taxonomy of Educational Objectives: Affective Domain (Krathwohl, 
Bloom & Masia, 1964) defines the affective domain as subsuming such traits as 
temperament, attitude, belief, interest, or value. Attitudes, beliefs, values, and 
interests are important outcomes of education. Shaw and Wright (1967) point out 
that “If the attitude of a person toward a given object, or class of objects, is known, 
it can be used in conjunction with situational and other dispositional variables to 
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predict and explain reactions of the person to the class of objects” (p. 2). Indeed, 
with respect to reading development, Athey (1976) suggests that “by increasing our 
knowledge of the affective life of the child, we broaden our understanding of his 
total functioning in the academic situation. In other words, the intellectual varia- 
bles involved in reading do not operate in isolation but are modified by the 
individual’s attitudinal and personality characteristics” (p. 352). 


Increased concern over the academic achievement of public school children 
necessitates a careful examination of potentially relevant factors which contribute 
to success in school. Since reading is the basis for achievement in most school 
subjects, it seems logical that if a child develops a positive attitude toward reading, 
the attitude may well carry over to other subjects. Questions are more frequently 
being asked about the affective parameters of reading, viz: Do certain practices, 
programs and materials in reading hinder or facilitate a positive attitude toward 
reading? Do children who can read, but do not, need remediation? How much do 
children read and why do some children read while others do not? Do all good 
readers have positive attitudes toward reading and/or vice versa? Answers to these 
questions hinge on estimating attitudes toward reading, an area of measurement 
which has been characterized by meager instrumentation to date in spite of its 
purported importance. Although attempts have been made to taxonomize affective 
objectives (Krathwohl et al., 1964), measurement in this area, in general, is still in 
its infancy in comparison to cognitive assessment and much remains to be done in 
defining the nature of affect, measuring it, and delineating its role in the education 
process (Khan & Weiss, 1973; Gephart, Engle & Marshall, 1978). A number of 
instruments have been developed in reading, but further work is needed to provide 
a solid base for assessing reading attitude (Alexander & Fillmore, 1976; Ewing, 
1977; Summers, 1977; Reed, 1978). 

The Estes Scales to Measure Attitude Toward School Subjects (Estes, 1971; 
Estes, Johnstone & Richards, 1975) are currently widely used in school research. 
The scales were developed to assess attitude in five content areas: English, 
mathematics, science, reading, and social studies. Validation studies thus far have 
shown promising results for the reading scales using subjects in grades seven 
through twelve (Estes & Johnstone, 1974; Estes et al., 1975) and grade eleven 
(Dulin & Chester, 1974). The purpose of this study is to present additional data on 
the internal consistency reliability and convergent validity of the Estes Reading 
Attitude Scale (ERAS) for a large population of intermediate grade subjects 
(Grades 5, 6, and 7). 


Internal Consistency 

The ERAS is a unidimensional 15-item instrument. The method of summed 
ratings is used in which responses are solicited to the battery of items using a 
five-point Likert scale with forward and reverse polarity. Subjects choose the 
alternative that best describes their reaction to each item. Attitude toward reading 
is defined as a liking for or a dislike of reading. The 15 items of ERAS are: 
Reading is for learning but not for enjoyment. 
Spending allowance on books is a waste of good money. 
Reading is a good way to spend spare time. 
Books are a bore. 
Watching T.V. is better than reading. 


ea ae a 
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6. Reading is réwarding to me. 
7. Books aren’t usually good enough to finish. 
8. Reading becomes boring after about an hour. 
9. Most books are too long and dull. 
10. There are many books which I hope to read. 
11. Books should only be read when they are assigned. 
12. Reading is something I can do without. 
13. Some part of summer vacation should be set aside for reading. 
14. Books make good presents. 
15. Reading is dull. 

Subjects were 1,403 intermediate children in 65 intact classrooms in nine 
elementary schools in Richmond, British Columbia (Grade 5, n = 448; Grade 6, 
n = 453; Grade 7,n = 502). All data were collected in early December of 1977. The 
Metropolitan Achievement Test — Reading (Durost, Bixler, Wrightstone, 
Prescott & Balow, 1970-71) was used to determine the reading characteristics of 
the intermediate subjects. The mean raw scores for total reading on the 
Intermediate and Advanced forms placed the three grade groups at the 68th, 58th, 
and 50th percentiles in reading achievement. The somewhat above-average 
percentile ranks for the three grades are typical of obtained scores when beginning 
year norms are used to interpret mid-year achievement test data. On a second 
administration of the MAT in June, 1978, the same groups achieved at the 58th, 
50th, and 50th percentiles respectively. The percentile score results and the plotted 
histograms for the three groups suggest a normal distribution for the subjects on 
the norms for the test with a slight negative skewing of scores. 

Hoyt reliability estimates (Nelson, 1974) were used to determine the internal 


TABLE 1 


ERAS MEANS, STANDARD DEVIATIONS, STANDARD ERRORS,AND HOYT 
INTERNAL CONSISTENCY MEASURES BY GRADE AND GENDER 


standard standard consistency 


n mean deviation error coefficient 
Grade 5 448 56.07 IN O)e5 S¥7/ BO: 86 
Grade 6 453 aos, 10.19 3.62 87 
Grade 7 502 55.96 10.38 B55) 89 
Grade 5 male 227 53.53 TORO 3.9%: - 86 
female Dai 58.68 9.41 3.39 - 86 
Grade 6 male Peas yh Ail 10.09 3.84 . 84 
female 230 Sean TOROS: \5 SX} 88 
Grade 7 male 241 53.71 TOS 61 3.49 -88 
female 261 58.03 9.73 sla 6k) - 88 
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consistency of the ERAS. (It should be noted that the Hoyt estimates are equiva- 
lent to Cronbach’s [1951] alpha.) Internal consistency, an aspect of reliability 
sometimes referred to as homogeneity, is roughly equivalent to the average Pearson 
correlation between all pairs of items. Table 1 presents the means, standard 
deviations, standard errors, and internal consistency coefficients by grade and 
gender for the ERAS. The internal consistency coefficients are all beyond .80 which 
is very high indeed for an affective instrument (Anastasi, 1976) and compares 
favourably with reliabilities obtained for many cognitive measures. Internal con- 
sistency coefficients are also beyond .80 for all three grades thus supporting the 
construct homogeneity of the ERAS. Reliability greatly influences the size of the 
Standard Error of Measurement; tests with high reliability have correspondingly 
small SEm’s. The SEm’s are relatively small and quite consistent for the three 
grade levels thus providing further evidence of good reliability for the ERAS. 


The item intercorrelation matrices (Table 2) for the three grades reveal typical 
patterns for a unidimensional instrument. All correlations of single items with total 
scores exceed .40, evidence that each specific item does a good job of measuring the 
construct of reading attitude (Ebel, 1972). In addition, as expected in a good 
unidimensional instrument, there are relatively low correlations between the 
separate items in the test. 


Convergent Validity 


Convergent (or criterion) validity can be defined as the extent to which test 
performance is related to some other valued measure of performance. Convergent 
validity is most often estimated by correlating test scores with alternative measures 
of the same construct. In this study, two informal instruments were developed to 
provide criterion measures: the Student Reading Rating Scale (SRRS) and the 
Teacher Reading Rating Scale (TRRS). Both measures were developed using an 
adaptation of the format reported by Dulin and Chester (1974). On the SRRS 
children are asked to rate themselves, in comparison with their peers, on three 
questions using a five-point Likert type scale: (1) About how Well do you read? (2) 
‘How well do you Like to read? (3) About how Much do you read? The range of 
summed scores for the scale is 3 to 15 points with a midpoint of 9. The TRRS is 
similar to the SRRS and asks teachers to rate children, in comparison with other 
children, in terms of: (1) How Well they read. (2) How much they Like to read. (3) 
How Much they read. Scoring is similar to the SRRS. Table 3 presents the means 
and standard deviations for the SRRS and the TRRS by grade level and Well, 
Like, and Much scores. The ERAS correlated .33 with the SRRS Well score 
combined for the three grades, .56 with the SRRS Like score combined for the 
three grades, and .48 with the SRRS Much score combined for the three grades. 
Supportive validity coefficients should be statistically significant at beyond the 
five percent level of confidence and of a reasonably high magnitude. There is 
debate on what is a “reasonably high magnitude” and recommendations in the 
literature range from .30 to .40 (Guilford, 1956). For our purposes, correlations of 
35 and above were deemed acceptable. All the validity coefficients for the SRRS 
are positive, significant beyond .05 (p < .001), and relatively high in magnitude. 
Discriminant validity is also indicated as the items correlate in the expected direc- 
tion; attitude correlates more highly with liking for reading and how much is read 
than it does with the achievement item asking how well one reads. The ERAS and 
SRRS validity correlations are similar in direction and magnitude to those found 
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TABLE 2 
ERAS INTERCORRELATIONS FOR GRADES 5, 6, and 7 


Grade Ol aes oO Agee yy WOl lM SRO Oo mot LT oe le aS 


10 


11 


12 


13 


14 


15 


Total 


Ne) 
NDUNDO YD NYDN NYDMW NDU YNDWN NYDN NDMN YNYNDUW YRDUN YRDUW NYDU YNRDU NAY 
Lo 
(os) 
Lo 
~ 
NS 
—~ 
> 
bo 
ht 
fo) 
> 
ran 
> 
Ww 
> 
[o.) 


*decimals omitted 


by Dulin and Chester (1974) for grade eleven (r = .36, .60, .54, respectively). Estes 
et al. (1975) also report a correlation of .28 between student self-estimate of 
attitude and the ERAS. 


Correlations between the ERAS and the TRRS Well, Like, and Much scores, 
combined for the three grades, were .36, .39, and .37, respectively. Again, although 
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TABLE 38 
SRRS AND TRRS MEANS AND STANDARD DEVIATIONS BY GRADE 
eee ee 


SRRS ¥ 
TRRS 
Grade 5 Grade 6 Grade 7 Grade 5 Grade 6 Grade 7 
mean sd mean sd mean sd mean sd mean sd mean sd 
eee 

Well 3595 98 oS OO Saye? 9D Sisal ALSO: 3 0 Fe eie a 3 LS AS 
Like B61 110 Sys ea A eho sh) leaky, 3)5,106) -90 Shoe lile ibs fe) Zils) i.04 
Much SOOM aS She Walks} SHsy0) alg y25} SOs) Tks! Lo OT pe 2T 5 Yosef Mai aa ES) 
Total 
Scale 10.62 2.69 LOG, 2.98 9.92 2.90 NPT Peele} OVO Sar2o 9.41 3.09 
N (448) (453) (502) (448) (453) (502) 


a 
Based on responses from 65 teachers 


of more modest magnitude, the validity correlations are positive, significant 
beyond .05 (p < .001), and in the expected direction. The ERAS and the TRRS 
correlations also corroborate the Dulin and Chester validity results in direction and 
magnitude (r = .33, .38, .40, respectively). Estes et al. (1975) did not report a 
correlation for teacher judgment and the ERAS. It is interesting to note that the 
TRRS teacher estimates correlate somewhat lower with the ERAS indicating that 
children’s estimates of attitude toward reading may be more valid. Dulin and 
Chester (1974; Chester & Dulin, 1977) suggest that children’s responses may pro- 
vide the most reliable criterion in attitude assessment. In validating an instrument 
using 36 photographs to assess attitudes toward reading for kindergarten, grade 
one, and grade two children, Redelheim (1976) also found that children’s scores on 
the attitude measure were more closely associated with their self-ratings of attitude 
than with ratings made by their teachers. 

Data from previous studies also indicate that achievement in a subject area will 
correlate positively with attitude measures in that subject. Thus, correlations be- 
tween reading achievement and reading attitude measures provide another 
indication of convergent validity. The ERAS correlates .33 with the MAT 
Vocabulary scores and .37 with the MAT Comprehension scores. Both correlations, 
although modest, are positive and significant (p < .001) and the two correlations, 
as might be expected, are similar in magnitude. These validity correlations are 
somewhat lower than the correlation of .50 reported by Estes et al. (1975) for the 
ERAS and the STEP test using older subjects. This suggests that attitude may 
become more important in relation to school subjects as children grow older. It 
would be interesting to see if this relationship holds with younger and older 
subjects in other content areas as well. No data were reported in the Dulin and 
Chester validity study on the correlation between reading attitude and 
achievement. 


Conclusions 
There is a need for good instruments which are capable of providing accurate 
estimates of children’s attitudes toward reading. Once instruments are available, 
further research can be conducted to indicate how attitudes develop and how they 
can best be influenced in a positive direction. Given the state-of-the-art in affective 
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assessment, chances for success are currently greatest if a combination of measures 
are used including both reactive and nonreactive types. Reactive measures are 
those in which subjects are aware a measure is being taken and include 
questionnaires, interviews, repertory grid techniques, and inventories. The 
questionnaire or inventory is the most frequently reported reactive measure now 
being used in measuring reading attitude (Summers, 1977). Nonreactive measures 
are those in which subjects are not aware a measure is being taken and include 
observation checklists and unobtrusive archival records such as library circulation 
data and classroom indices. On the basis of the validity data from this study, it 
seems reasonable to conclude that the ERAS is useful in obtaining a global reactive 
assessment of school-oriented attitude toward reading in intermediate grades. The 
internal consistency and convergent validity findings also confirm results obtained 
in previous studies using older subjects. However, with a reported readability level 
of grade 6, users of the ERAS should be alert to the possibility that some poor 
readers may need help in responding to the scale. It should also be borne in mind 
that these conclusions are based on responses to a given instrument by the specific 
group of subjects surveyed in this study. 
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Predicting the Educational Bias Effect 


Bias effect research has been conducted over the past 15 years, with many 
investigations relating to the educational system. The authors note that isolated 
investigations may be looked at together to build a logical argument that biasing 
conditions indigenous to the public school delivery system do influence teacher 
perceptions, thereby influencing actual child behaviour. Laboratory techniques for 
investigating the educational bias effect have progressed to the point where 
investigators may begin to study the variability in a group’s susceptibility to this ef- 
fect. This paper represents one such effort and documents a significant relationship 
between teachers’ acceptance of diverse behaviour and their resistance to the biasing 
influence of a deviancy label. 


The term educational bias effect refers to the phenomenon whereby a teacher’s 
expectations for a child’s behaviour may actually become a determinant of the 
child’s behaviour. This area of research has its antecedents in the experimenter 
bias effect isolated by Rosenthal and Fode (1963). These researchers noted that 
experimenters with preconceived notions of the “proper” outcomes of an 
experiment inadvertently influenced human subjects to perform in ways consistent 
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with the experimenter’s bias. Although attempts to replicate Rosenthal and Fode’s 
study have been to some extent unsuccessful (Barber, Calverley, Forgione, 
McPeake, Chaves, & Bowen, 1969), the term experimenter bias effect was coined 
and researchers began to investigate the phenomenon’s influence in social situa- 
tions outside the research laboratory (i.e., school classrooms). 


Palardy (1969) demonstrated that existing teacher expectancies were related to 
the academic performance of first grade children. His study differentiated between 
teachers who believe boys and girls learn to read with equal facility and teachers 
who held to the notion that, relative to girls, first grade boys are handicapped in 
learning the basic reading skills. Palardy concluded: 


When first grade teachers reported that they believed that boys are far less successful 
than girls in learning to read, the boy pupils of these teachers did achieve less well on a 
standardized reading test than a comparable group of boy pupils whose teachers reported 
that they believed that boys are as successful as girls in learning to read. (p. 374) 


Researchers next investigated variables indigenous to the educational system 
which have the potential to influence a teacher’s expectations for a child’s success 
in school. Jones (1970) showed that perceived sociocultural level influences 
teachers’ expectations of a child’s self-perceptions. Cahen (1966) demonstrated 
that IQ and reading group assignment were sufficient as biasing factors to alter 
student teachers’ grading of test booklets. Ross (1974) found that physical 
attractiveness is a determinant of teacher expectations. When exposed to a 
standardized and nebulous psychological report, experienced classroom teachers 
more often felt attractive children belonged in regular education classes while 
unattractive children would best be served through special education resources. A 
variety of other factors have been shown to be influential in biasing teachers’ 
expectations for their students. Brophy and Good (1974) have provided an 
excellent review of this literature; they indicated that positive and negative “halo 
effects” operate in “teachers’ ratings of students just as in any interpersonal 
ratings” (p. 135). Within the area of special education, research has consistently 
demonstrated that the series of medical model deviancy labels utilized by the 
public schools generate lowered expectations for child performance. Foster and 
Ysseldyke (1976) investigated the relationships between teacher expectations and 
the labels of mentally retarded, emotionally disturbed and learning disabled. Using 
a laboratory technique, it was shown that each of these labels lowered expectations 
and that this halo effect was sufficient in strength to cause experienced teachers to 
misinterpret videotaped child performance. Further investigations have 
demonstrated that this phenomenon operated even when the teachers were 
cautioned to be objective in their ratings of the videotaped behaviour (Foster & 
Salvia, 1977). A variety of other studies have shown that teachers react negatively 
to special education labels as expectancy generating stimuli (Algozzine & Mercer, 
in press). 

Teachers’ relative acceptance of diverse behaviour has been the focus of recent 
research. Algozzine (1977) has shown that teachers find disturbing behaviours 
differentially bothersome while Herr, Algozzine and Eaves (1976) have shown 
reactions to disturbing behaviour may be modified with training. Algozzine, Mercer 
and Countermine (1977) demonstrated that acceptance of a child’s behaviour was a 
function of whether that behaviour was or was not expected based on the 
categorical label assigned the child. Thus variance exists in the degree to which 
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teachers find certain behaviours disturbing and the above investigations have 
refined a technique to measure this variability. 


This investigation represents research within a fourth generation of bias effect 
studies. The first generation of work isolated the effect in a non-educational 
setting, the second generation demonstrated that the effect influences actual child 
behaviour in an educational setting and the third generation has demonstrated 
that the educational bias effect may be elicited by a variety of variables present in 
our educational system. The fourth generation of research, as represented here, has 
investigated teacher variability as regards susceptibility to the bias effect. It is 
anticipated that such work will eventually lead to the prediction of the effect and 
suggestions to minimize the detrimental results that bias has for individual 
children. 


Method 
Subjects 


Subjects consisted of 28 regular education classroom teachers, 25 female and 
three male. The teachers were attending a three-day continuing education 
workshop offered through a large southern university. Mean age of the subjects was 
31.2 years and mean years of experience was 5.8 years. The subjects were about 
evenly distributed over the public school grade range with 29% teaching K-3, 25% 
teaching 4-6, 25% teaching 7-9, and 21% teaching grades 10-12. Nineteen subjects 
held Bachelor’s degrees while 9 held Master’s degrees. 


Materials 


Disturbing Behavior Checklist (DBC). The Disturbing Behavior Checklist 
(DBC) is a factor analytically derived scale of behaviours which have been shown to 
be differentially bothersome (i.e., disturbing) to teachers. It contains 55 items, 
most of which can be arranged into four mutually exclusive factors. Sets of 
behaviours which are bothersome because they are immature, socially defiant, 
physically disturbing, and socially delinquent have been identified (Algozzine, 
1977). The validity of the scale has been evaluated using logical, correlational, and 
experimental procedures; results of these studies have been reported by Algozzine, 
Barnette, and Danielson (1977). The reliability of the scale and subscales (i.e., 
coefficient alpha) has been reported to range from .62 to .93 (Algozzine, 1977) and 
was considered adequate for use of the DBC in this research. 


In responding to the checklist, subjects are requested to rate the relative 
“disturbingness” of the 55 items of which it is comprised. The responses are placed 
on a 1-5 Likert-type scale with 1 = not very disturbing and 5 = very disturbing. 
Thus high scores are indicative of a respondent finding items (behaviours) to be 
relatively disturbing. Respondents are instructed to rate each item with regard to 
how disturbing it would be in working with children. 


Videotape. Bias was measured by teacher reactions to a 12-minute colour 
videotape of a normal fourth grade male. The tape showed the child taking subtests 
of the Wide Range Achievement Test, the Peabody Individual Achievement Test, 
several psychomotor exercises and engaging in a free play period. The child was se- 
lected from a pool of over 400 children on the basis of normalcy in the areas of 
intellectual, academic and social development, and physical appearance (Foster, 
Ysseldyke & Reese, 1975). 
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F-Y Checklist. Reactions to the videotape were measured by responses on a 
23-item referral instrument developed for use with this tape. Subjects were asked 
to rate the child’s behaviour in various areas including academic skills, 
perceptual/motor development, activity level and personal/social adjustment. 
Ratings were made along a 106 mm continuum divided into five levels from 
“superior” to “far below average.” Distance along this line constituted the score for 
each item, with the average distance for all items constituting the total score for 
the measure. Low scores are indicative of positive ratings and scores above the 
midpoint (i.e., 53) are considered below average expectations. The total score for 
the F-Y Checklist consists of the average for all items completed; this score was 
seen as an appropriate measure since previous research had indicated that 
subscores within the scale (i.e., for academics or activity level) were not 
significantly different from this total (Foster, 1976). 


Procedure 


Data for this investigation were collected in two separate settings. Subjects 
were asked to complete the DBC during the first session and to rate the videotaped 
behaviour during the second session held two days later. Separate researchers 
conducted the two sessions and the subjects were not told that the two data 
collection points were related. 


During the bias effect portion of the study, subjects were given the F-Y 
Checklist and told that this instrument would be used to provide responses to a 
videotaped child. The tape was accurately explained to the subjects except they 
were not told that the child was normal. Instead, the subjects were told that the 
child had been diagnosed by a clinical team as being retarded and this label repre- 
sented the biasing information given the teacher. The tape was shown, F-Y 
Checklists completed and collected, and the subjects were debriefed concerning the 
true nature of the research. 


Results 


The F-Y Checklist yielded one score for each teacher, representing an overall 
mean for the responses to the 23 items of the instrument. The Disturbing Behavior 
Checklist yielded five scores, corresponding to means for the four DBC factors and 
an overall mean. Group means and standard deviations for DBC and F-Y Checklist 
responses are presented in Table 1. The descriptive data suggest that the teachers 
rated the child’s behaviour as below average (i.e., above midpoint of 53) and found 
the behaviour of the DBC to be moderately disturbing. 

A set of six dependent ¢ tests was utilized to determine if the observed 
differences present in teachers’ ratings of the disturbingness of the DBC factors 
were statistically significant. The results of these tests are posted in Table 2; 
nonsignificant differences are underlined. The behaviours comprising the socially 
defiant factor of the DBC (i.e., destructiveness, disobedience, negativism) were 
seen as more bothersome than those of the other three factors. The results are 
consistent with those of other investigations (Algozzine, 1976, 1977). 

To determine the relationship between teachers’ ratings on the DBC and their 
responses to the F-Y Checklist, correlation coefficients and proportion of variance 
estimates were computed (Ferguson, 1971); the correlations are presented in Table 
3. A significant relationship was found between the F-Y Checklist responses and 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR SUBJECTS’ RESPONSES 
TO DBC AND F-Y QUESTIONNAIRE 


F-Y Overall Social Socially Physically Socially 
DBC Score Immaturity Defiant Disturbing Delinquent 
62.79 S58) SiG 3.9 oS 53} 
7.93 0.4 0.4 0.6 0.9 0.7 
TABLE 2 


RESULTS OF T TESTS COMPARING RESPONSES TO FOUR DBC FACTORS 


Socially Physically Socially Social 
Defiant Disturbing Delinquent Immaturity 
SIS) SNS) Soe! S672 
TABLE 3 
CORRELATION COEFFICIENTS BETWEEN F-Y QUESTIONNAIRE AND DBC 
FACTORS 
Overall Social Socially Physically Socially 
DBC Score Immaturity Defiant Disturbing Delinquent 
La 
Checklist at 38% 18 13 30 
A SHO 


the overall DBC ratings, accounting for about 20% of the variance. Only the 
socially immature subscale of the DBC yielded a significant relationship with the 
F-Y Checklist, accounting for about 14% of the variance. 


Discussion 


The results of this administration of the F-Y Checklist are consistent with pre- 
vious data collected under similar conditions. Previous research (Foster & 
Ysseldyke, 1976; Foster, Ysseldyke & Reese, 1975; Foster & Salvia, 1977) has 
demonstrated that F-Y Checklist ratings, when beyond the midpoint rating of 53, 
are indicative of the bias generated by the described research technique and, in 
fact, constitute a halo effect measurement. The purpose of this investigation was to 
find correlates to the bias effect, not to duplicate previous research by 
demonstrating that the technique results in establishment of a bias. Readers 
interested in the establishment and measurement of the bias effect are referred to 
these previous studies. 


Teachers’ responses to the socially defiant factor of the DBC were significantly 
different from their responses to any of the other factors. These results are similar 
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to those of other investigations (Algozzine, 1976; Curran, 1977; Mooney & 
Algozzine, 1978) and lend support for the validity of this administration of the 
instrument. 


The significant correlation between bias effect ratings (F-Y Checklist) and 
DBC ratings suggests that those teachers who found the DBC behaviours to be less 
disturbing were also less susceptible to the bias effect. The DBC factor of social 
immaturity was the best single factor predictor of checklist scores and was the only 
factor to obtain significance as an isolated predictor. This makes intuitive sense as 
it is logical that the stereotype for retarded children would contain elements 
relating to immaturity. Teachers who found immature behaviour more disturbing 
were also most susceptible to the biasing influence of the “mentally retarded” label. 


Research links exist which will allow us to logically deduce that a teacher’s 
susceptibility to bias will influence the academic performance of certain children. 
This investigation has established that a relationship exists between a teacher’s 
reported acceptance of diverse behaviour and his/her susceptibility to the bias ef- 
fect. It seems to follow that the degree of acceptance may also relate to academic 
performance of at least some children. 

This research has also demonstrated that a laboratory technique may 
successfully be used to investigate variance in susceptibility to the educational bias 
effect. It is felt that this report of the relationship between susceptibility to bias 
and non-acceptance of behaviour should serve as an impetus for further research, 
rather than as the basis for decisions within the field. 

Anticipating the day when the bias effect is openly admitted to operate in ways 
harmful to children and the relationship between bias and non-acceptance is 
verified, we offer the following suggestions to reduce the negative effects: 

1. School system personnel may wish to reduce the number of negatively biasing 
variables indigenous to the classroom setting (e.g., deviancy labels, rampant 
use of mass administered intelligence and achievement tests). 

2. We may wish to re-examine our present emphasis on homogeneous grouping 
patterns. Smaller class sizes with expanded ability ranges would allow for more 
individualization of instruction and less artificiality in our expectations of 
what children should or should not be like. 

3. Preservice and inservice training in broadened acceptance of behaviour could 
be instituted within universities and school systems. Ongoing emphasis toward 
acceptance would be more effective than any short term efforts to alter 
teachers’ expectations. 

4. School system personnel may wish to make use of these findings in their 
selection process for new teachers. A teacher’s tolerance toward diverse behav- 
iour could be assessed and could be considering along with GPA, GRE scores, 
and other personal information. 

5. Sophisticated matching of teacher and pupil may one day become an empirical 
and administrative possibility. The data presented here would indicate that 
teachers who find many behaviours to be disturbing may be less effective in 
working with certain children as they will be more negatively influenced by the 
biasing labels these children carry. 
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PERSPECTIVES 


Research Traditions in the History of Education: 
A Study of Goals and Their Emergence 


Henry W. Hodysh 
The University of Alberta 


Eighteen years have elapsed since Thomas Kuhn first published his study of 
the paradigm and its function in the history of science (1970a, first published 
1962). Although Kuhn’s interpretation of the paradigm as an organizing principle 
of research has stimulated extensive discussion in many fields of inquiry (Kuhn, 
1970b, 1970c), it has generated little activity among historians of education! who, it 
would seem, have remained somewhat distant from his concerns. A number of 
reasons might account for this inattention, though perhaps it would not be 
inappropriate to speculate that many historians of education have viewed Kuhn’s 
methodological interests as subordinate to the more established research problems 
in the field. 

Some progress in correcting this oversight might be achieved by exploring the 
implications of Kuhn’s thesis for the way in which educational history has emerged 
as a field of study in the European setting. In determining these implications, the 
effect of social and educational forces external as well as internal to the community 
of historians itself might be taken into account (Kuhn, 1970a, p. x). Such an 
approach would be of assistance in complementing the genetic or linear perspective 
so frequently adopted towards the study of the past (Hodysh, 1977). Attention 
might be directed to the concept of the paradigm as a first step in this endeavour. 


For Kuhn, the operation of the paradigm as an organizing principle in the 
development of knowledge is exemplified in the following observation: 


At the start a new candidate for paradigm may have few supporters, and on occasions the 
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supporters’ motives may be suspect. Nevertheless, if they are competent, they will 
improve it, explore its possibilities, and show what it would be like to belong to the 
community guided by it. And as that goes on, if the paradigm is one destined to win its 
fight, the numbers and strength of the persuasive arguments in its favor will increase. 
More scientists will then be converted, and the exploration of the new paradigm will go on. 
(1970a, p. 159) 


This assertion entails two interdependent meanings of paradigm, both of which are 
integral to Kuhn’s position. On the one hand, “paradigm” represents the 
“disciplinary matrix” or “the entire constellation of beliefs, values, techniques, and 
so on shared by members of a given community.” On the other, it “denotes one sort 
of element in that constellation, the concrete puzzle-solutions which, employed as 
models or exemplars, can replace explicit rules” as a basis for solving the puzzles of 
normal science (1970a, p. 175). The paradigm as puzzle-solution, therefore, is 
contained within and dependent upon the interaction of elements in the 
disciplinary matrix. In sum, the paradigm as disciplinary matrix plays a 
foundational role in the development of knowledge in the scholarly community. It 
is the ground or basis of the essential characteristics and components which define 
the community’s research traditions (Kuhn, 1977, p. 297). 


If the paradigm can be employed as a basis for the study of research traditions 
in the scientific community, what, then, are its implications for the analysis of 
research traditions in the history of education? Insight into this question might be 
gained by focusing more closely on the way in which historians of education 
perceive the development of goals in their field of study. According to Kuhn, 
members of a professional community see themselves “and are seen by others as 
the men uniquely responsible for the pursuit of shared goals” (1970a, p. 177). Itisa 
knowledge of these goals as shared commitments that becomes critical to our un- 
derstanding of the disciplinary matrix, for such knowledge helps to establish an 
essential component of the community’s research traditions. 


The problem might be more clearly defined in relation to the following ques- 
tion: In what way are the goals and commitments of educational historians 
reflected in their historical research? The question is critical to our study for it 
directs us to the concerns of the historian and, as such, serves as an end or focal 
point for the research activities of the community. Its significance in the study of 
research traditions might be clarified through an examination of the community’s 
professional literature (Kuhn, 1977, pp. 35ff) — a task which recognizes that a field 
of study is defined not only by elements integral to its disciplinary matrix but, as in 
the case of intellectual history, by forces extant and operative in the broader 
cultural-historical milieu (Kuhn, 1977, p. xv). 


By way of illustration, then, this study explores developments in the European 
context with particular reference to Polish educational historiography during the 
first quarter of the twentieth century.2 The setting is of special interest, for it was 
during that time — after some 125 years of partition and occupation by Austria, 
Prussia, and Russia — that Poland achieved independence and embarked upon a 
program of intellectual development which extended into many areas of life 
(Nauka Polska, Vol. I, 1918; Nawroczynski, 1938, pp. 173-193), including the study 
of history and history of education (Morawski, 1922, p. 14; Konopczynski, 1925, p. 
4). In many ways it was a period of intellectual renewal (Perkowska, 1975) in which 
historians refurbished their links with a rich and colourful past and, at the same 
time, looked ahead to the future (Dyboski, 1933-34, p. 318). Moreover, it was a time 
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when the history of education contributed in a special way to the nation’s cultural 
identity (Barycz, 1949, 1967; Hulewicz, 1974). 


The study, citing selected developments in Polish educational historiography, 
focuses specific attention on the professional goals and commitments of the 
community of historians as factors central to the emergence of its research 
traditions. It endeavours to determine the nature of these goals through a descrip- 
tive analysis of the community’s professional literature. Within this general 
framework, both the possibilities and limitations of Kuhn’s position are considered 
without necessarily accepting its particulars. 


The Emergence of Educational History 


The development of Polish historiography during the years preceding 
independence in 1918 is of particular interest to the growth of educational history 
as a field of study. Although the partitioning powers tended to neglect the study of 
Polish history, using it as a means to advance their own interests (Karbowiak, 1900, 
p. 8), Polish historians themselves continued their historical research and 
contributed a significant body of literature to the nature and purpose of the field 
(Bujak, 1930; Dyboski, 1933-34). The efforts of historians to describe and explain 
the development of the nation’s past resulted in the formation of the Cracow and 
Warsaw Schools of Historical Thought (Dutkiewicz & Sreniowska, 1959). Whereas 
the Cracow School perceived the setbacks and defeats that led to partition to be 
the result of internal problems, the Warsaw School attributed the nation’s 
misfortunes to external developments and political events beyond her control (Kot, 
1941, p. 32; Serejski & Grabski, 1974, pp. 5-34). These differing perspectives 
influenced much of the discussion regarding the function of history during the 
latter part of the nineteenth and early part of the twentieth century. 


Within this setting, the history of education was studied as part of the nation’s 
cultural history. As a special interest, it was represented in the writings of Szujski 
(1878), Windakiewicz (1890), Morawski (1900), and others (Hulewicz, 1960) and, in 
terms of its formative development, could point to a tradition that reached as far 
back as the eighteenth century (Lempicki, 1925, pp. 1-2). 

By the early 1900’s, scholars at the Jagiellonian University in Cracow and the 
University of Lwow suggested that the history of education as a study required 
greater attention and development. At the Third Congress of Polish Historians, for 
example, Karbowiak advocated a reconstruction of archival and monographic 
materials in the field, proposing that a closer look at Polish educational history 
would create opportunities not only for an examination of problems particular to 
education but for awakening the younger generation to the “spirit of the nation” 
(1900, p. 7). Karbowiak argued that these objectives might best be achieved by 
establishing academic chairs in Pedagogy and the History of Education at Polish 
universities. His suggestion was echoed later by Kukulski who called for a program 
in the History of National Education at the Jagiellonian in Cracow. Alluding 
briefly to the historical debates of the nineteenth century, Kukulski also suggested 
the need for new directions in the interpretation of Poland’s educational past 
(1914). It would seem, therefore, that an intellectual climate conducive to 
educational history as a specialization within the broader field of cultural history 
was already in the making before the outbreak of World War One. 


The achievement of Polish independence following the war opened a new 
chapter in the history of education as a field of study. In Cracow on November 5, 
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1919, the Minister of Religious Denominations and Public Education, Lukasiewicz, 
charged a commission under the direction of Morawski, President of the Polish 
Academy of Sciences and Letters “to fill a great vacuum in the area of pedagogical 
thought and practice.” 


In relation to this aim, the Ministry has called to life a Commission of History and 
Pedagogy to research the history of education and schools in Poland. The Commission has 
prepared a very broad plan of educational activity and has hopes of a series of studies and 
monographs in the history of education and pedagogical thought. (column CL/42) 


During its first years of operation, the Commission gained vigorous support from 
historians and educators. In a newsletter from the Lwow section of the 
Commission, Lempicki commented upon the importance of its activities to the 
nation. 


... with the birth of our homeland .. . it is necessary to build a deep feeling of nationhood 
and its system of education: we must look to our educational past which will serve as a 
bond to the present system of education . . . (1922, p. 2) 


The Commission contributed to a renewed interest in the history of pedagogical 
thought (Barycz, 1949, pp. 30-37) and led to the publication of a variety of 
historical investigations, including among them studies of Konarsky, Piramowicz, 
Kollataj, Sniadecki, the history of parish schools, and the famous Commission of 
National Education of 1773 (Bobkowska, 1937, p. 143; Ministerstwa . 
Publicznego, 1921, pp. 167ff), Europe’s first ministry of education (Dyboski, 1933, 
p. 270). 


Organization and Development 


A key figure in the organization and development of research in educational 
history during the first decade of independence was Professor Kot from the 
Department of Cultural History at the University of Cracow. In 1913, Kot had 
visited archives throughout Europe in order to determine the place of Polish 
scholarship in the context of the European historical community (Sprawozdanie... 
Akademi Umiejetnosci, 1914, p. 6). Later, in his role as Secretary to the 
Pedagogical Commission, he coordinated a cadre of experts from Cracow, Lwow, 
Warsaw, Poznan, and other centres in organizing and developing archival 
materials. As author and editor of many scholarly publications, Kot turned his 
attention to a number of areas for research, most notably the history of education, 
the Reformation of the sixteenth and seventeenth centuries in Poland, and the 
relation of Polish thought to the wider European historical scene (Lempicki & 
Hartleb, 1937, pp. 11-12). His participation in learned societies provided visibility 
to the history of education and placed it among the developing interests of 
historical and educational scholarship in Poland (Tazbir, 1976, pp. 752-755). 
Bruckner, a prominent historian of the period, noted Kot’s interest in the history 
of education as a stroke of good fortune for a “much neglected” field of inquiry 
(1910). 

Among Kot’s studies of the past was the Historja wychowania (The History of 
Education), a two-volume cultural history of pedagogical thought and practice that 
placed the development of Polish education in the mainstream of European 
thought (1924). The study, though directed to the interests of the Polish historical 
community, was international in scope. It acknowledged not only the role of recent 
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Polish scholarship in the history of education but also the contribution of such 
writers as Barth, Compayre, and Paulsen (Kot, 1924, pp. 20-21). 


Kot perceived the history of education to be part of the history of culture. As a 
field of inquiry, it was expected to meet the rigorous standards applicable to all 
historical scholarship and, as such, to be studied objectively and “founded upon the 
fullest of documentary evidence” (1924, p. 11). Utilizing a perspective that 
encompassed both institutional and noninstitutional education as topics for 
investigation, Kot introduced references to economic, artistic and other cultural 
factors as important in establishing the dominant “pedagogical ideal” in any given 
period of time (Kot, 1924, p. 7). The Historja begins with a description and expla- 
nation of education in classical Greece and Rome. This is followed by educational 
developments as they occurred in the Medieval, Renaissance, and eventually, the 
Modern periods of history. In the Polish setting, attention is directed to 
Humanism, the educational reforms of Konarsky, the Commission of National 
Education of 1773, the Reformation and Protestant Schools, the Jesuits, education 
under three-power partition, and, finally, a brief commentary on educational 
developments in the post World War One period. 


If the importance of an individual’s contribution to his field is gauged in part by 
the assessments of his colleagues, then Kot’s leadership in the development of 
Polish educational historiography deserves particular recognition. For example, a 
review of the Zrodla do historja wychowania (Sources to the History of 
Education) — a compilation and study of original source materials highlighting the 
development of pedagogical theory and practice — awarded high marks for his 
breadth and originality of scholarship (Bobkowska, 1930). It appears that Kot’s 
efforts in the study and organization of educational history raised the field to a po- 
sition of prominence, affecting in a significant way the historical research of his 
colleagues and students. (Barycz, 1967, pp. 202ff.). 


In sum, a number of events critical to the formation of goals held by members 
of the community of historians occurred in the first quarter of the twentieth 
century. The community revitalized the study of Poland’s educational past and, in 
the process, added a dimension to cultural history, the impact of which extended 
beyond the disciplinary matrix to the entire fabric of Polish intellectual life. 


Discussion 


It would be quite inappropriate to generalize on the basis of an exploratory 
study. Nevertheless, our inquiry lends itself to making a number of observations 
concerning both the possibilities and limitations of Kuhn’s notion of the paradigm 
as disciplinary matrix — observations which centre on three categories or types of 
issues pertaining to the emergence and perception of goals in the field of 
educational history. The first category points to the influence of the 
socio-historical milieu on the function of educational history; the second relates to 
the nature of historical inquiry qua historical inquiry; and the third reflects some 
of the problems or limitations connected to Kuhn’s notion of the paradigm that 
bear tangentially on the Polish data but which, nonetheless, are significant to 
further research. 

The study suggests that the perception of goals held by members of the 
historical community may be shaped by developments immediate not only to the 
community itself but, as indicated by Kuhn, to a wider range of factors in society in 
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general. The achievement of Polish independence in 1918 appears to have been one 
such development. Its impact was made evident in a variety of contexts, including 
the creation of the Pedagogical Commission of 1919. To cite but one example, 
Lempicki’s comment on the function and activity of the Commission reflects an 
interest in educational as well as historical questions confronting the newly created 
state. Such questions, therefore, were not the private preserve of historians, as they 
were discussed frequently by the scholarly community as a whole. Moreover, this 
does not imply that all historians of education were equally influenced by events in 
society or that they tended to interpret or view those events from an identical 
perspective. Rather, it is suggested that the professional activities of the historical 
community, including the selection of topics considered to be worthy of 
investigation, were affected to some extent by concerns common to all. However, 
the precise nature of this interaction between the community of historians and 
society as a whole is difficult to determine. It is a problem which in its own right 
necessitates further investigation. 


In relation to the second issue, attention was directed to the nature of goals and 
commitments in terms of both the data of educational history and the framework 
in which the data were enclosed. For example, it would appear that Kot’s selection 
and interpretation of historical events were accompanied by an awareness of 
conceptual questions addressed by historians of education. This awareness, as 
evidenced by Kot’s reference to the nature of history of education and the 
contribution of his colleagues and predecessors to both the theory and practice of 
the study, served as a point of departure in the Historja for advancing two general 
kinds of historical inquiry. The first was a recognition of specialization and a 
concern for developing an in-depth body of knowledge based on monograph type 
studies in order to keep abreast of the ever-increasing quantity of historical data 
and research. The second was a recognition of the need to provide a synthesis of 
Polish history that would serve not only as an overview of the educational past but 
as a means of identifying and clarifying the course of the history of a nation which 
for more than a century had broken continuity. Kot’s concern with both areas of 
inquiry in the history of education placed him in the vanguard of historical 
research in the field as it developed in the early decades of the century. 


In the Polish setting, therefore, the history of education as a field of study was 
influenced not only by the interaction of elements within the disciplinary matrix 
itself, but also by events and conditions in the wider society not the least of which 
were the expectation and achievement of independence of 1918. On the one hand, 
the coming of independence challenged the historical community to study more 
closely the social, educational, and other issues confronting the newborn state, and, 
on the other, it encouraged the community to develop anew the tradition of 
research which was to extend beyond the historical questions of the nineteenth and 
into the years of the twentieth century. 


It must be noted that our study of Kuhn’s paradigm is illustrative only and that 
many historical developments have not been considered. How, for example, might 
Kuhn’s analysis be employed to account for the apparent emergence of conflicting 
goals and commitments within the community? Additional study of what might be 
identified as a broader range of events, along with an analysis of the ideas of other 
historians and educators who influenced Polish educational historiography, would 
add to our understanding of research traditions. 


In relation to the third category, the matter of limitations, it should be 
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reiterated that our inquiry has merely endeavoured to identify methodological 
possibilities and to explore a tentative rationale which historians of education have 
tended to overlook. The study has not defended a methodological alternative and, 
therefore, would be remiss if, however briefly, it did not take into account apparent 
limitations which the analysis has suggested. 


The more obvious of these limitations pertain to the description of goals and 
their representation in the professional literature of the historical community. For 
example, the evidence by which goals and commitments are determined may not be 
readily apparent in an analysis of the professional literature of the community, 
necessitating a closer look at a broader spectrum of community involvements. 
Moreover, it might be necessary to explore community activities and societal 
conditions which are peripheral to the community itself, but which through their 
indirect influence bear upon the emergence and formulation of research traditions. 
A study of the informal literature as well as an analysis of the role and influence of 
other scholarly communities may hold insights into the nature and function of the 
historical community and its research traditions. 


Beyond these rather specific problems are others no less significant, but more 
general in nature. For example, do educational historians comprise an extended 
group, numbering among its constituents not only academic historians but scholars 
from the field of education? Put differently, what criteria might be employed to 
define membership in the community of educational historians? Further, in what 
way do programs of teacher training and education affect the goals of the 
community and the determination of its research traditions? 


Implications 


Interestingly, the implications of Kuhn’s position for the study of educational 
history are both particular and general. On the one hand, Kuhn’s argument 
provides some direction to historians and, quite possibly, to others in related fields 
by suggesting specific focal points for investigation and further inquiry; on the 
other, it brings to attention the possibility of integrating some of Kuhn’s ideas on 
the emergence of research traditions with existing methodological frameworks, a 
possibility which would seem to reflect a certain breadth and flexibility inherent in 
much of Kuhn’s position. 

The inquiry into the paradigm and its possibilities and limitations has been 
heuristic in purpose. No attempt has been made to draw definitive conclusions. 
Moreover, there are many aspects of Kuhn’s position that have not been examined, 
such as the notion of the preliminary or pre-paradigm stage of a discipline’s 
development. Shortcomings notwithstanding, the study has raised tentative though 
promising hypotheses — hypotheses which upon clarification and development 
might lead not only to a more critical analysis of Kuhn’s argument but, perhaps, to 
a better understanding of educational history, its goals and research traditions. 


Notes 
1. The terms “history of education” and “educational history” are used interchangeably in this paper. 


2. Diacritical marks found in Polish will be omitted due to complications in printing. 


The writer gratefully acknowledges The Canada Council for providing a Research Grant used in the preparation of this 
study. 
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The Pedagogical Institute, University of Warsaw; The Pedagogical Institute, Jagiellonian University of Cracow; and The 
Polish National Library in Warsaw for their assistance in obtaining Polish historical materials. 
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TEACHING EDUCATIONAL STATISTICS 
IN THE COMPUTER AGE 


Ross E. Traub 
The Ontario Institute for Studies in Education 


STATISTICAL METHODS IN EDUCATION AND PSYCHOLOGY. By A. K. 
Kurtz and S. T. Mayo. New York: Springer-Verlag, 1979, 538 pp. (ISBN 
0-387-90265-1) 


During the course of my lifetime, the computational power available to 
members of the general public, but especially to university students enrolled in 
statistics courses, has increased enormously. The only computational device avail- 
able to me from 1943 to 1955, the years I attended school in a small Alberta town, 
was my father’s analog computer, his slide-rule. It is true that in 1946, J. Presper 
Eckert and John W. Mauchly succeeded in building the first digital computer. But 
this invention did not affect Alberta students directly for more than ten years. As 
recently as 1958 when I enrolled in the introductory statistics course offered by 
Professor E. S. Keeping, all we had in the stat-lab were hand-cranked Munroe 
calculators. The computing resources then available to Alberta’s educational 
researchers were not much better. During the summer of 1959, when I worked as a 
research assistant for Professors G. M. Dunlop and Donald Black, the principal 
computing aids to be found in the Faculty of Education were several electrically 
driven mechanical calculators. It wasn’t until 1960, when I returned from a year of 
teaching school to work on a Master of Education degree, that I encountered the 
University of Alberta’s first digital computer. Professor R. S. MacArthur had the 
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prescience to insist I take a noncredit course in how to program it. Although the 
name of this computer has passed from memory, I do recall that it was difficult to 
program, primarily because instructions were fed into it by means of a continuous 
paper tape which had to be made with painstaking care. In view of developments 
since 1960, however, that noncredit course was one of the most profitable 
investments of time I ever made. Since 1960 the power of digital computers has 
increased almost exponentially, year by year. So too has their accessibility. In addi- 
tion, the development of the LSI chip has meant that for a very few dollars anyone 
can acquire an electronic calculator small enough to be held in the hand, yet able to 
make the same computations faster and more quietly than the mechanical devices 
in use 20 years ago. 


There is a point to be found in this bit of personal history. The world of 
computing has changed dramatically over the past twenty years. Instructors of 
statistics and the textbooks they use should be keeping pace; they should be 
making full and good pedagogical use of the computational power now available. 
Short-cut methods of “hand calculation” should interest historians of science, and 
no one else. Methods for grouping data to facilitate computation, Sheppard’s cor- 
rection which is needed to reduce the error in the standard deviation that is caused 
by grouping, and eight ways to obtain a square-root without the aid of a calculator 
are prime candidates for a museum display case. I can think of no good reason for 
consigning today’s student to an academic purgatory from where safe passage is 
earned by making seemingly endless calculations, all to obtain only one or two 
statistics. Granted, there may be instructional value in a limited amount of “hand 
calculation,” where the dimensions of the problem are the minimum required to 
reveal or to reinforce some concept. Beyond this point, however, students should be 
directed to the nearest available computer terminal, there to draw from storage 
data sets of realistic proportions and to investigate the characteristics of these sets 
by having the computer generate histograms, scatterplots, and all manner of de- 
scriptive and inferential statistics. To me, this approach to the teaching of 
statistics offers considerable promise: Understanding of concepts can be tested and 
modified through studies of varied and meaningful data sets; expertise can be 
acquired in handling data sets of realistic size; and, perhaps most important of all, 
the habit can be fostered of examining the match between assumption and data, of 
considering alternative statistics, and of trying different analytical models. (A good 
attempt in this direction, albeit one that is not directed specifically at budding 
educational researchers, is the MINITAB Student Handbook by Ryan, Joiner and 
Ryan, 1976.) 


This sunny morning of the computer age is associated with the development of 
another significant emphasis in statistics, that of exploratory data analysis. The 
alternative and traditional emphasis is confirmatory analysis. In performing 
analyses of the latter type, the statistician uses data to test the validity of a 
prediction or hypothesis, something that was, or should have been, formulated 
through application of hypothetico-deductive logic well before the data were 
collected. In confirmatory analyses, statistics is used to test the match between 
empirical reality and substantive (as opposed to statistical) theory. On the other 
hand, the thrust of exploratory data analysis is very different. The operative 
metaphor is the process of detection as employed by a master sleuth. In 
exploratory analysis, an extant set of data is deemed interesting and worthy of 
study. It is examined closely in order to ascertain its salient features. Unsuspected 
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characteristics are noted. Clues are sought on which to base hypotheses or 
explanations. Ways by which to test hypotheses and explanations within the limits 
of the data are sought or invented. The statistician’s objective is to provide a 
coherent description and interpretation of the data, not necessarily the only pos- 
sible or even the best description and interpretation, just one that is internally 
consistent and plausible in the same way that a detective’s description and 
interpretation of the data associated with a crime are internally consistent and 
plausible. 

One feature of exploratory methods bears special mention. This is a concern the 
exploring statistician has for the effect of bad data. Exploratory analysts try to use 
methods that are robust, which is to say, they try to describe and interpret data 
using results that are insensitive to “outliers,” i.e., bad data. Inasmuch as 
educational data in my experience are fraught with outliers, this emphasis is a 
welcome one indeed. (The testaments of exploratory data analysis have been writ- 
ten by John Tukey and Frederick Mosteller — c.f., Tukey, 1977; Mosteller & 
Tukey, 1977. An introductory text that incorporates the ideas of exploratory data 
analysis has been prepared by Canadian sociologists, B. H. Erickson and T. A. 
Nosanchuk, 1977). 


Computational power is a great asset to the exploratory data analyst (although, 
as Tukey has shown, much can be done by hand). Computers facilitate the 
statistician’s study of a data set. With a computer, a terminal, and a set of 
analytical routines that can be called upon in the interactive mode, it is easy for the 
statistician to explore quickly the characteristics of his data, and just as quickly to 
test out hypotheses, examine the effect of transformations, etc. 


All of which brings us to the raison d’étre for this review, the introductory 
statistics text by Albert K. Kurtz and Samuel T. Mayo. Briefly, this book is an 
anachronism. Kurtz and Mayo pay very little attention to the digital computer in 
their writing and they provide no basis at all for using it as a pedogogical aid. 
Moreover, the concept of exploratory data analysis and the topic of robust 
statistics are nowhere covered. 


Having said this, I must admit that the book has some excellent features, and 
were this 1950 I'd probably recommend it highly. It’s strengths are these: Good 
coverage of classical topics for a half-term (one-semester) course; plentiful and 
varied examples; a healthy respect for the standard error of a statistic — they 
report standard errors for all but one of the statistics they present; a breezy style of 
writing, well punctuated with humour; the inclusion of most _ possible 
interpretations of the statistics they present, some that are based on years of 
experience with educational and psychological data. 


A chapter by chapter summary of the book’s contents is as follows: Chapter 1 
offers a rationale for statistics. It also contains a useful discussion of significant 
digits. Chapters 2 and 3 are devoted respectively to the topics of central tendency 
and variability. Not unexpectedly, in view of the lack of concern Kurtz and Mayo 
display for outliers, the mean and standard deviation are given favoured status. 
The fourth chapter is focussed on the normal curve. The relationship between this 
curve and the binomial frequency distribution is given appropriate emphasis. 
Statistical inference is the topic of Chapter 5. Many of the familiar concepts of 
confirmatory data analysis are discussed in this chapter — sampling distributions, 
the standard error of the mean, the ¢ test, degrees of freedom, errors of inference 
(Types I and II), one- and two-tailed tests. A surprising omission, however, is the 
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concept of statistical power. Chapter 6 is focussed on percentiles and their 
calculation, Chapter 7 on the shape of a distribution, its skewness and kurtosis. 
Correlation and regression take centre stage in Chapters 8, 9 and 10. The 
presentation on Pearson’s r includes illustrations of scattergrams for selected 
correlation coefficients and many examples of correlations from published studies. 
These examples suggest realistic expectations for the sign and magnitude of r for 
the kind of variables frequently encountered in educational and psychological 
research. This basis is augmented by an explicit set of statements from Kurtz and 
Mayo to be used in evaluating the r obtained in a study. Chapter 11 presents the 
chi-square statistic and illustrates its application to problems of contingency and 
to testing the fit of a theoretical distribution function to an observed frequency 
function. Several other nonparametric statistics are described in Chapter 12 — the 
sign test, the runs test, the median test, and the Mann-Whitney U test. This is 
followed by a chapter on one-way analysis of variance. In Chapter 14, a general 
framework is developed for testing differences between pairs of statistics. This 
framework is subsequently applied to means and standard deviations. The 
fifteenth and final chapter is entitled Reorientation. One purpose of this chapter is 
to confer “most important status” on six concepts: mean, standard deviation, 
Pearson’s r, analysis of variance, statistical inference and normal distribution. 
Chapter 15 also briefs the reader about advanced courses and suggests uses for the 
statistics taught in this book. 


In addition to the foregoing chapters, Kurtz and Mayo have provided 22 tables 
in an appendix. These include the usual entries for the normal, t, F, and chi-square 
distributions. Also, there are tables for use in testing the statistical significance of 
Pearson’s r, point-biserial r, tetrachoric r, and the phi-coefficient. Consistent with 
their philosophy of ignoring the computational devices now available to students, 
Kurtz and Mayo have also tabled values of the function N(N-1) for the integers 
from 1 to 499, the products of all pairs of the integers from 1 to 34 with the integers 
from 1 to 49, and the square of N and the square roots of N and 10N for the inte- 
gers from 1 to 999. 


To conclude: If you and your students have access to a digital computer and if 
you, the instructor, wish to develop a respect for robust methods and interest and 
skill in exploratory data analysis, then this is not the statistics textbook you need. 
The only reason I can suggest for buying a copy is to reflect upon the strong 
opinions and recommendations of two experienced instructors. 
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RELIABILITY OF THE ALBERTA ESSAY SCALES 


The technique of grading essays through comparisons with a set of models has 
enjoyed at least intermittent popularity for several decades. The earliest scales 
(Hillegas, 1912; Willing, 1922; Van Wagenen, 1923; and Hudelson, 1923) remained 
in use until at least the 1930’s. A scale produced by Educational Testing Services 
(1957) was listed in the Sixth Mental Measurement Yearbook (Buros, 1965, p. 592). 


Nyberg and Nyberg (1977) developed two sets of essay models suited especially 
for grading compositions written by students at the grade 12 level in Alberta. The 
models were derived from essays written by students in English 30 in June, 1964. 
In that year, a random sample of 103 essays was drawn and graded by 21 raters for 
style and content, and by another 27 raters for English mechanics. The resulting 
data were used to determine the reliability of the essay grading procedures 
employed in connection with the Alberta departmental examinations (Nyberg, 
1966). From the sample of 103 essays, models representing stanines 2, 4, 6, and 8 | 
were chosen for style and content, and another set also representing stanines 2, 4, 6 
and 8 was selected for use in grading mechanics of English. Reasons for developing 
two scales instead of one, and details of how the scales were developed are provided 
in the technical report (Nyberg & Nyberg, 1978). 

When the two scales, style-content and mechanics, consisting of model essays, 
had been completed, the question remained as to whether they could be used by 
teachers to grade students’ essays with at least some degree of reliability. The 
study reported here was designed to answer this question. 


Procedure 


The general procedure was quite simple. A set of essays, written by English 30 
students, was rated independently by each of six teachers through use of the 
models. The mean correlation between all pairs of raters was then calculated. 

In April of 1977, four teachers administered an essay test to their English 30 
classes. The test consisted of writing a 200-word essay on the topic “The Ideal 
School.” A time limit of one hour was imposed. The students were from unselected 
classes; that is, they ranged in achievement and ability from low to high. In order to 
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motivate the students to perform at their best, it was announced that the test re- 
sults would contribute to their final course grades. Eighty-three essays were turned 
in for use in this study. 


Six experienced teachers of high school English each graded the full set of 
essays. Each of the teachers was provided with a copy of the models to be used for 
rating and each was asked to practise using the models as suggested in the scale 
booklet. Each rater then graded each essay twice — once for style-content, and 
once for mechanics. No marks were placed on the students’ test papers; instead, 
scores were entered on grading sheets retained by the raters. This procedure 
prevented a reader from knowing how any essay fared at the hands of the other 
readers. Each essay was identified by number only; therefore it was impossible for 
the rater to know who had written the essay or from which school it came. 

The correlation between scores awarded by each pair of raters was determined. 
The mean of all the correlations between pairs of raters was taken as an estimate of 
the reliability of the grading procedures. 

The means and standard deviations of the scores awarded by the six raters over 
the 83 essays were calculated and then examined for differences. 


Results 


Table 1 shows the correlations between all pairs of raters for English mechanics 
and Table 2 shows the means and standard deviations. Tables 3 and 4 provide sim- 
ilar information for the style-content grades. 

The reliability of the scoring for mechanics, estimated by computing the mean 
of the correlations in Table 1, was found to be .782. For style-content, a reliability 
estimate of .772 was calculated. These values compare favourably with estimates 
from other studies. Standard errors of measurement were estimated to be .659 for 
style-content and .751 for mechanics. In the study from which the essay models 
were derived, Nyberg (1966) reported reliabilities of .768 for mechanics and .595 for 
style-content. It was concluded that a respectable reliability of scoring can be 
achieved through use of the essay models by teachers rating twelfth-grade essays. 

Analysis of variance indicated no significant differences (p < .05) among the 
raters when they graded for mechanics, but for the style-content component, the 
differences were highly significant. This meant that some of the raters consistently 
graded essays higher (or lower) than other raters on the style-content aspect. This 
is of no particular consequence to teachers dealing only with their own classes; 


TABLE 1 
CORRELATIONS BETWEEN MECHANICS SCORES AWARDED BY SIX RATERS 


Rater Number 


Rater Number 2 3 4 5 6 
1 949 836 AS - 880 830 
2 823 OMe - 821 809 
5 592 738 689 
4 837 698 
D -820 


V. R. Nyberg & A. M. Nyberg 


TABLE 2 
MEANS AND STANDARD DEVIATIONS OF MECHANICS SCORES 


Rater Number Means Standard Deviation 
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TABLE 3 

CORRELATIONS BETWEEN STYLE-CONTENT SCORES AWARDED BY SIX 

RATERS 


Rater Number 


Rater Number 2: 5 4 5 6 
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MEANS AND STANDARD DEVIATIONS OF STYLE-CONTENT SCORES 


Rater Number Means Standard Deviation 
L Dog iseBe 
2 5.699 1.462 
3) 6.470 1.320 
4 Sreoue 1.300 
5 5.470 Is S28) 
6 5.940 1.563 


however, in cases where the models are being used to grade essays for purposes of 
comparison where several raters are involved, measures would have to be taken to 
reduce or eliminate the differences. This might be done through discussions among 
the teachers to resolve the differences between the “hard” and the “soft” raters. In 
this study, there was no group discussion of the grading. A simple mathematical 
correction could be employed to eliminate differences among raters. Converting all 
ratings to T scores is one of the procedures that would serve the purpose. 
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Alberta Essay Scales 


Concluding Statement 


The reliability of scoring through use of the essay models was quite satisfactory 
in view of results obtained in other studies. Nyberg (1978, p. 8), after reviewing the 
literature, stated: “Reliability coefficients involving senior high school students and 
several classroom teachers as raters are seldom higher than .6.” This should not be 
taken to mean that the reliability coefficients were high. Values of .782 and .772 
would be considered weak in mathematics problem-solving tests or in multiple 
choice tests in, say, science. It must be emphasized, also, that this study 
investigated reliability of SCORING only, which must not be confused with TEST 
reliability. That is to say, the only source of unreliability that was studied here was 
within the readers, while test reliability includes variation from one situation to an- 
other. 


The primary value of the Alberta Essay Scales would be to assist the classroom 
teacher in day-to-day scoring of essays by providing a constant standard. The 
scoring procedures are not sufficiently reliable to permit decisions regarding 
individual students to be made on the basis of one essay. However, if, as is 
ordinarily the case, final grades are based on at least a dozen writing assignments, 
decisions for individuals would be reasonably sound. 


Verner R. Nyberg 
Department of Educational Psychology 
The University of Alberta 


and 


Adell M. Nyberg 
Spruce Grove Composite High School 


Copies of the Alberta Essay Scales are available from the Curriculum Branch, Alberta Education. 
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BOOK REVIEWS 


THE REFORM OF TEACHER EDUCATION IN THE UNITED KINGDOM. By 
James Lynch. Guildford, Surrey: Society for Research into Higher Education, 
1979, 211 pp. (ISBN 0-900868-68-6) 


This is essentially a book about the role played in teacher education by the 
Council for National Academic Awards, and about the advantages of “a modular 
course system” for teacher training programmes. On these topics, the book is very 
well documented and provides, in text and appendices, detailed information that 
would be of use to course planners. 


The Council for National Academic Awards is a body set up by the Department 
of Education and Science for the express purpose of validating courses offered by 
institutions of higher education not entitled to validate their own, that is to say, 
non-university institutions. It is the keystone of the structure built up in the U.K. 
over the past twenty-five years to provide higher education outside the universities. 
Together the two systems constitute “the binary system” of higher education, and 
the key to the difference lies in the fact that universities in this country are funded 
by the University Grants Committee which draws its resources directly from the 
Treasury, whereas polytechnics and other non-university institutions are funded 
through channels negotiated by the Department of Education and Science with the 
local education authorities. The split began on a substantive basis, responsibility 
for the development of higher technical education being removed from the domain 
of the universities, which continued to be responsible for the development of 
academic studies. That difference has since been steadily eroded, as is indicated by 
the fact that the original National Council for Technological Awards, set up in 
1955, became in 1964 the National Council for Academic Awards. While the author 
of this book recognizes that central government has been involved in a “power 
game” (p. 2), he seems to see no threat in the take-over of teacher education under 
cover of the developing binary system. He indicates instead that government con- 
trol of manpower in the education system necessitated the move. 


A reader of Chapter 1 (the historical context) and Chapter 2 (contemporary 
developments) might be forgiven if he nurtured a growing sense that the reference 
to “reform” in the author’s title constituted a grand piece of irony. Predictions of 
sharply falling school enrolments are, of course, a major factor in the situation 
described, but there are many other sources of uncertainty and confusion, and the 
outcomes are vacillation, delay, panic action, administrative muddle, and a 
generally low morale. 


But with Chapter 3 (the expansion of C.N.A.A. activities), there is a marked 
change of tone: it is clear that the Fairy Prince has now entered upon the scene and 
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the stage is set for the promised reform. The Fairy prince is, of course, the Council 
for National Academic Awards, and it seems that the magic power by which he will 


deliver the system from bondage lies principally in provision for modular courses. 
Of the C.N.A.A. we are told: 


It has been argued that, in terms of its democratizing and academic impact, the Council 
represented an innovatory development which far surpassed the contribution of the new 
universities founded in the 1960s and even the much-vaunted Open University, and which 
evolved without the massive injections of capital that those institutions required. 
Moreover, its foundation marked the beginning of an important development in teacher 
education towards the sharing of responsibility, for it gradually weaned much of teacher 
education from the autonomous to the public sector of higher education, from an “élitist” 
towards a “mass” tradition of further and higher education. (p. 58) 


Chapter 4 spells out the curricular implications of this “innovatory 
development” and Chapter 5, its organizational implications. Chapter 6 is a brief 
look at the shape of things to come, and here an odd kind of Utopia comes into 
view: 


Increased control directly from central government and indirectly through the regional 
councils . . . will enable the central government to regulate not only teacher education 
numbers and courses but also other courses in higher education, and while it is not likely 
to stem the greater variety of course provision which has been stimulated by the entry of 
C.N.A.A. to the validation field, it will nonetheless have effects on the curricula of the 
institutions of higher education and also on the curricula of the schools; effects which 
remain as yet relatively unappreciated. Such developments carry important, if indirect, 
implications for the control of school curricula by central government, and thus also for 
the distribution of knowledge in society. (p. 140) 


Let me make it clear that I share the author’s respect for what the C.N.A.A. has 
achieved, and that I have for many years been concerned, as he is, to develop 
innovatory courses which would introduce a stronger element of professionalism 
into teacher education. That said, the biased nature of his account must be recog- 
nized. What the universities have contributed, both through their education 
departments and more generally, and what colleges of education have achieved and 
continue to achieve in cooperation with the universities is almost totally ignored. 
The “waning influence” of universities is greeted as an aspect of reform, and the 
reader is led to believe that, thankfully, the university phase in teacher education is 
over and we need think no more about it. A writer who can equate university 
education with “élitism” and take-over by government as “democratization,” and 
who is disposed to conflate manpower planning with curricular control, is surely 
taking a myopic view of the role of education in society. Such a view of teacher 
education would have been challenged had he raised his eyes even so far as to 
glance at a neighbouring profession, that of the medical service. 

This book was originally written in German and published in Germany: it is to 
be regretted that so partial a view of teacher education in the United Kingdom 
should be exported, whether to Germany or to the American continent. 


James Britton 
Emeritus Professor of Education 
in the University of London 
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TEACHING CANADA FOR THE ‘80s. By A. B. Hodgetts and Paul Gallagher. 
Toronto: The Ontario Institute for Studies in Education, 1978, 136 pp. 


Hodgetts and Gallagher indicate that their book is “presented to educators and 
all other interested Canadians in every part of the country as a guide — or at least 
a starting point — to teaching about Canada in the next decade” (p. 6). This guide 
is comprised of seven chapters in which the authors attempt to define the nature of 
Canada studies, outline possible content of Canada studies programs, and identify 
the unique roles of the elementary and secondary schools in achieving the goals of 
such programs. In addition, chapter seven sets forth a particular framework for 
approaching instruction of public issues in secondary school classrooms. How use- 
ful the guide will be to educators with concerns in these areas is open to question. 


In attempting to arrive at a sound conceptual base for their scheme, the authors 
have run into definitional problems. The impression is left that there has been no 
real attempt to explore the idea of Canadian or Canada studies but that the 
authors armed with a preset notion of what content students should acquire have 
dispensed rather quickly and inadequately with the rationale for such studies. An 
example of the resultant confusion may be seen in conflicting statements such as: 


We believe that Canada studies — as distinct from all other more restricted Canadian 
studies should deal with our society as a totality, in country-wide, interlocking 
perspectives. (p. 2) 


followed only a few pages later by: 


A clear distinction may now be made between Canadian studies and Canada studies. The 
adjective “Canadian” described a much broader field of study, including all kinds of local 
or specialized concerns, as well as a wide range of human activities of no direct political 
significance. (p. 11) 


Further confusion is created when Canada studies are next equated with civic 
education which is “designed to encourage a development of the skilled and 
sensitive public opinion needed to resolve deep-seated differences in the Canadian 
political community” (p. 11). 


Semantics aside, the proposed Canada studies program seems limited in scope 
and richness. The social science perspective put forward lacks the human concern 
so evident in proposals such as that of Symons (1975). For Symons, it is important 
that Canadians develop self-knowledge, an understanding of themselves, who they 
are, where they are in time and space, where they have been, where they are going, 
what they possess, and what their responsibilities are to others. There is as much 
concern for Canadian art, literature, and philosophy as there is for the functioning 
of parliament and the mechanics of the economic system. Symons’ proposals stir 
the imagination and challenge the intellect. In contrast, the text-like proposals set 
forth in the book, which begin in the elementary school with a study of basic 
citizenship and Canadian communities, move through a concern with the “total 
environment of Canada” at the beginning of secondary school to the Canadian 
political and economic systems, and finally culminate with the examination of 
public issues in the last years of the secondary school, seem pedantic and 
uninspiring. 

The limitations of the range and scope of Canada studies proposed are coupled 
with an equally inadequate approach to knowledge. Despite notable contradictions 
within the text, the dominant view of knowledge is that described by Freire (1971) 
as the “banking concept” in which the teacher’s task is to deposit with the student 
what is considered to be the truth about Canada. Faith in this approach is 
characterized by statements such as: 
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Yet it is primarily by raising the knowledge levels of our students that we may hope to 
achieve the kinds of understandings and respect for others that are the attributes of a 
civilized people. (p. 3) 
Pages 38 through 117 of the book follow this approach by outlining theories, 
themes, and generalization which, the authors indicate, students should learn 
about Canada. There are close to 200 statements predicated with the stem 
“Students should understand” or some equivalent variant indicating a specific 
piece of knowledge for which students should be held accountable. 


This view of knowledge, popularized by Bruner and so characteristic of the 
social studies projects of the early sixties, is bound to disappoint those who view 
such approaches to knowledge building as questionable. The knowledge base 
proposals not only seem to lack the support of many current theorists, but also to 
ignore the results of important contemporary studies in the area of civic education 
such as that of Torney, Oppenheim, and Farnen (1975) who have indicated that: 

The acquisition of knowledge does not correlate highly with support for democratic val- 

ues. . . . Neither does knowledge of civics correlate highly with various measures of 

political participation. (p. 19) 

Thus, even if youngsters were to acquire the knowledge base outlined, the authors’ 
stated purpose of encouraging “Education for citizenship — for an active role in the 
life of society” (p. 19) would not seem to be assured. 


There is a sharp contrast between the suggested programs for the elementary 
grades and that proposed for the secondary schools in the amount of specification 
provided by the authors. The elementary program outlined begins with primary 
grade children exploring areas such as community participation, the basic 
responsibilities of community members, and an examination of public controversy. 
Out of these studies, it is suggested, children will develop attitudes toward Canada 
rather than accumulate information. It is further argued that the present content 
of social studies programs in the primary grades is suitable — all that is required to 
turn it into a Canada studies program is a new emphasis on the local community as 
a dynamic social reality. 

In the middle and upper years of the elementary school, children will study 
community life in Canada and the world, Canada as a political community, Canada 
in the world community, participative citizenship, compromise and consensus, and 
representative government. There is also a plea for history and geography of 
Canada in the last years of the elementary school, but how this fits into the total 
scheme is left unclear. It is expected that from such a program children will emerge 
from the elementary school with “a real sense of personal identification with 
Canada as a whole,” and will be ready for “a more systematic study of the basic 
components of Canada studies” (p. 33). 

The junior high school is to be primarily devoted to a study of the Canadian 
Environment, focussing on the development of five major generalizations: 

1. Canada is a northern, vast, and regionally divided country. 

2.Canada has a broad natural resource base composed of both renewable and 


non-renewable resources. 
3. Canada is an industrial, technological, and urbanized society. 
4.Canada is a culturally diverse, multi-ethnic country with two historically dominant 


linguistic and cultural groups. 
5.Canada is exposed to a multitude of external economic, political, and cultural 


influences. (p. 12) 


By the end of the junior high school years, it is contended, students will have an 
understanding of the complexity of Canada and an awareness of the 
responsibilities of citizenship in operating the Canadian political community. 


Fal 


Students in the ninth to twelfth grades, approximately, are to build upon this 
base knowledge of what Canada is, by moving to how Canada functions. This is to 
be accomplished through the examination of the sytems that Canadians have 
developed, primarily the economic and political systems. Following this detailed 
study of systems, students move to a study of public issues as the climax to all of 
their preceding work on Canada studies. 


The rationale and suggestions for the elementary school program are dealt with 
in 24 pages, the early secondary school in 16 pages, while the middle and upper 
secondary programs are detailed in 75 pages. Those concerned with elementary 
education will probably find the suggestions interesting but superficial; those 
interested in secondary education will probably find the detailed suggestions 
highly restrictive and overprescriptive. All will likely find that the proposals pro- 
vide a limited explanatory framework of Canada and Canadian development. This 
is due in part to the interpretation of Canada within particular political and 
economic models and the extent to which human experience elsewhere is ignored. 


The proposed framework raises serious concerns about the nature of Canada 
studies and the imbalances that could occur in youngsters’ social learnings were 
such a plan to be followed in schools. The authors are aware that their scheme may 
be confining, at least in terms of geographic settings explored: 


It is legitimate to contend that an exclusive preoccupation with Canada and things 
Canadian throughout the elementary years is too narrow and chauvinistic. (p. 28) 


However, they do not seem to be sensitive to the possibility that the approach 
taken to knowledge and knowledge building may be equally restrictive. 


Teachers will find the model set forth in the last chapter the part of the book 
most readily put into practice. However, before doing so, they will want to consider 
carefully the inquiry model proposed. It consists of a series of six questions set 
forth under three headings, Contemporary Analysis, Historical Analysis, and 
Future Analysis. It is suggested that this is “a total package for studying any major 
public issue in Canada” (p. 123). 


The assumptions underlying the model have not been set forth by the authors. 
In questioning the effect that extended use of such a model might have on 
students, one would hopefully have considered that it seems to support the follow- 
ing: (1) Human problems can be reduced and managed with the application of a 
particular inquiry model, (2) If only the “correct” questions are asked then issue 
resolution will be assured, (3) Students are incapable of exploring issues without 
the aid of a recipe, (4) One model for inquiry into issues is satisfactory, (5) The 
scientific-technical model proposed is appropriate for all issues, (6) Application of 
the model absolves students of any further intellectual responsibility, (7) Issue 
exploration for students does not proceed past the appraisive level. Many will find 
these underlying assumptions worthy of examination before proceeding to use the 
model in classrooms. 

One of the other obvious shortcomings of the model is that it makes no attempt 
to alert students to the methodology-in-use. The model lacks what Goldmark 
(1968) has labelled as inquiry-into-inquiry. This essential component of knowledge 
building asks that learners explore what they did, how they did it, why they did it 
that way, and if they should do it that way in the process of problem-solving. This 
inquiry into the methodology used to explore issues would seem necessary if 
students are to do other than to apply unwittingly, in a mechanistic manner, 
questions which will eventually influence what they are able to think and the very 
way in which they will ultimately structure their own reality. 


Advocation of Canadian content in school programs boasts wide and vocal sup- 
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port. It represents a concern over the lack of a national consciousness and a 
struggle for a distinctive Canadian identity free from colonial or neo-colonial 
influence. Teaching Canada for the 80’s represents one of the few attempts to de- 
scribe a comprehensive program for schools in this area. It would be unfortunate if, 
in our zest to be Canadian, the obligation to be scholarly were compromised. 


D. Massey 
Department of Elementary Education 
The University of Alberta 
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EDUCATIONAL PSYCHOLOGY. By David W. Johnson. Englewood Cliffs: 
Prentice Hall, 1979, 592 pp., $12.95 (U.S.). 


The three main subdivisions of this omnibus introductory text in educational 
psychology are before teaching, during teaching, and after teaching. Four chapters 
each are devoted to before and after teaching, while 9 chapters outline the 
psychological concepts of importance during teaching. The before teaching section 
sets out the scope of educational psychology, the crisis in educational goals, 
cognitive and intellectual development, and socialization and social development. 
Major theorists and their theories are set out in sufficient detail for beginning 
students. The 9 chapter section dealing with during teaching outlines in rather 
pragmatic form the problems involved in specifying learning goals and outlines 
both the cognitive and the behavioural approaches to instruction. Johnson also 
deals with attitude change, motivation, and learning environments to ensure skills 
and abilities are available to users of his text. He specifically deals with conflicts 
and classroom control and discipline, topics that are often sparsely treated in many 
educational psychology texts. A section dealing with students who have special 
needs, while not too appropriate for special education students, is quite adequate 
for the classroom teacher. The after teaching section deals with problems in 
evaluating students, evaluating instructional effectiveness, a sociologically oriented 
chapter dealing with improving your school and school district, and finally a 
synoptic statement dealing with your career as a teacher. 

Johnson states that the book does not assume any prior knowledge of 
psychology, that the approach is eclectic with differing points of view presented 
and no single stance being adopted. There appears, however, to be a 
social-psychological stance that runs throughout the entire book. This should not 
be surprising with Johnson’s interest in the areas of social psychology and 
interpersonal relations. While there is a claim that the units are self-contained, 
there are circumstances that require and/or direct the reader back to earlier 
sections of the book. The group cooperative activities approach to instruction and 
learning — one that is endorsed within the text itself — is obvious from the many 
student activities that are presented. This represents a serious attempt to bridge 
the gap between theory and practice and is accomplished quite successfully. No 
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student workbook is needed which is a positive feature. An Instructor’s Manual is 
apparently available, but it was not received for this review. Johnson’s approach is 
based upon 14 years of teaching experience and other professional work that he has 
been associated with. Other positive points include the highlighting of key concepts 
during printing, and the inclusion of incidental information relating to several 
topics under discussion that is not critical to the textual material, but certainly 
does expand upon the concept for those interested. Group activities spaced 
throughout the book require the students to clarify their own concepts and to 
reconcile their concepts with the theoretical statements that they are studying. 
Chapter summaries are also included. 


Despite many of the claims by the author, there are some obvious attempts to 
present a particular point of view. For example, Gagné’s taxonomy is the only 
usable structure for stating educational objectives. Bloom is given passing mention, 
but no attempt has been made to place his work in historical perspective. Missing 
also is the listing of the Taxonomy of Educational Objectives: Handbook II, and 
Harrow’s 1972 work on A Taxonomy of the Psychomotor Domain. 


The after teaching section of the book dealing with evaluating students should 
be improved. Measurement and evaluation are characterized as unpleasant tasks 
due to negative student reaction, yet properly handled they can be an asset to any 
teacher and will be appreciated by the students. Without evaluation, students 
would receive no feedback and would be unable to assess their relative position 
within the school setting — something they and their parents insist upon doing. 
The section dealing with statistics is far too short — a separate appendix would be 
more appropriate — and contains an incorrect definition of the standard deviation. 
Technical qualities of instruments are stated in order of importance — reliability, 
validity, objectivity, practicality and discrimination. Many writers would instead 
list validity, reliability and usability. 

Standardized tests are covered in some detail, but the standard reference work, 
O. K. Buros and his seven editions of the Mental Measurements Yearbook, are not 
mentioned, nor are they included in the references. This omission is a serious error 
as without knowledge of the Buros publication, choosing a standardized test 
efficiently is virtually impossible. A brief section on teacher-made tests includes 
suggestions for item construction. The matching item on p. 468, however, ignores 
this advice. 

Since 1968, much attention has been devoted to the emerging concepts involved 
in educational evaluation as exemplified in the more recent publications of R. 
Stake, D. Stufflebeam, M. Scriven, and M. Provus. This entire field of study has 
simply been omitted and as it constitutes a significant area of study today, it 
should be included in an introductory text. Many improvements that have 
occurred in the school system can be attributed to evaluation studies and this 
makes the area too important to ignore. 


On balance, the first thirteen chapters (before and during teaching sections) are 
for the most part quite appropriate for an introductory course. There is a glossary 
which includes most of the psychological terms the student encounters, as well as a 
References section that lists over 750 books and articles. For most beginning 
students this unselected reference list will be somewhat formidable, but many 
valuable publications are listed therein. There is also a workable index. Johnson 
writes in a very readable fashion, and if you subscribe to his social-psychological 
approach, you will find much of value for your students. 

Frederick C. Rankine 
Professor of Education 
University of New Brunswick 
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CHARLES K. CURTIS 
The University of British Columbia 


Developing Critical Thinking Skills in 
Nonacademic Social Studies Classes 


Frequent references to critical thinking as an objective of social studies instruction 
are found in the literature. This paper reports the findings of a study intended to 
teach critical thinking skills in nonacademic secondary classes. Two research 
designs, the nonequivalent control group design and the pretest-posttest design, 
were used in the study. Students in the experimental classes and in the classes 
included in the one-group, pretest-posttest design were taught the skills identified 
in Ennis’ model of critical thinking as they investigated a contemporary community 
problem. Students in the control classes followed the prescribed social studies 
programs. A significant difference in favour of the experimental group was found 
between the adjusted mean posttest scores of the experimental and control groups 
on the Cornell Critical Thinking Test. The difference between the pretest and 
posttest mean scores on this instrument was significant for the students in the 
one-group, pretest-posttest design. Geographic location and sex of the students 
were not significant factors in the study. 


Concern with critical thinking as an objective of public school education dates back 
at least to the period when the philosophy of John Dewey was significantly influencing 
educational practice. Contemporary social studies literature abounds with references to 
the commitment of a democratic society to instruct its youth in the intellectual skills 
deemed essential for making rational decisions about societal problems. Despite this 
apparent interest, there is a paucity of literature describing research studies that have 
attempted to identify effective strategies for teaching critical thinking skills in social 
studies classes — a paradoxical situation that has been noted by, among others, Ellis 


doctorate in Special Education at Utah State University. For his dissertation, which was concerned with citizenship education 
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(1942), Harris (1960), Shaver (1962), and Ennis (1963). This article reports a study 
designed to test the efficacy of a program for teaching critical thinking to 
slow-learning secondary students. 


Review of Literature 


Critical thinking was defined by Ennis (1962) to be the “correct assessing of 
statements” (p. 83). Initially, he described critical thinking as consisting of 12 aspects, 
reduced in a later article (Ennis, 1964) to nine. The critical thinker was characterized 
as possessing proficiency in judging whether (1) a statement follows from the premises, 
(2) something is an assumption, (3) an observation statement is reliable, (4) a simple 
generalization is warranted, (5) an hypothesis is warranted, (6) a theory is warranted, 
(7) an argument depends on an ambiguity, (8) a statement is overvague or 
overspecific, and (9) an alleged authority is reliable. A tenth proficiency was added in 
1971 —- judging whether a reason is relevant (Ennis & Millman, 1971, p. 1). 


The concept of critical thinking that Ennis proposed was comprised of three basic 
factors or dimensions: the logical, critical and pragmatic dimensions. According to 
Ennis (1964), the determination of these dimensions and the identification of the 
several aspects resulted from a “study of the literature in education, philosophy, and 
psychology” (p. 599). Although this model has been criticized (e.g., Berlak, 1965; 
Shaver & Berlak, 1968, p. 339), the proficiencies that Ennis enumerated correspond 
for the most part with the sets of critical thinking skills that have been accepted by 
social studies educators (e.g., Aldrich, 1948; Decaroli, 1973; Henderson, 1950; Morse & 
McCune, 1971). 


There have been few research studies concerned with the development of critical 
thinking skills in social studies classes, and those studies have frequently produced 
inconsistent and inconclusive findings. Nevertheless, suggestions for teaching critical 
thinking skills can be inferred from several pieces of social studies and related 
research. Among these suggestions are the following: First, increases in critical 
thinking are not likely to occur as a by-product of instruction aimed at other 
objectives (Barratt, 1965; Henderson, 1950; Kemp, 1963; Hyram, 1957). Second, 
teaching strategies that focus on the teaching of specific critical thinking skills as 
elements in a model for investigating problems have been successful (Armstrong, 1971; 
Cousins, 1963; Lee, 1968; McGarry, 1962; Yost, 1973). Third, the subject matter used 
to teach critical thinking has included the study of current events and contemporary 
problems and issues (Anderson, Marcham & Dunn, 1944; Creutz & Gezi, 1965; Miller 
& Weston, 1947; Oliver & Shaver, 1966). Fourth, there is evidence to support the 
contention that children in the elementary grades can be taught critical thinking skills 
(Boedeker, 1971; Brown, 1969; Hunkins & Shapiro, 1967; Hyram, 1957; Lee, 1968; 
Pitts-Scarangello, 1972). 


Method 
Sample 


The high school students (N = 225) involved in the present study were enrolled in 
special classes for slow learners and nonachievers, and in  nonacademic, 
vocationally-oriented programs in eight schools located in six school districts 
throughout British Columbia. The ages of the students ranged from 15 to 21 years, 
and mean ages for the classes ranged from 15.5 to 18.3 years. A comparison of these 
mean ages with the mean ages for comparable grades revealed that project students 
tended to be at least two years older than their peers in the academic classes. The 
percentage of boys (47.1%) was not significantly different from the percentage of girls 
(52.9%). The school districts, selected for inclusion in the study by the use of a table 
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of random numbers, varied in geographic size, numbers of schools, teachers, and 
students. The communities in which the project schools were located varied in size and 
degree of urbanity, from the Greater Vancouver region to small rural communities in 
the interior of the province. 


Program 


The treatment program was developed from inferences based on the preceding 
research studies of critical thinking. During the four-month period of the project, 
students in the experimental classes employed an inquiry problem-solving model to 
investigate the problems and issues related to the conditions of housing in their 
communities. In particular, student inquiry addressed the following questions: Does a 
housing problem exist in your community? What factors affect the cost of housing? 
What can be done to lower the cost of housing? What can be done to provide more 
rental accommodations? What are the rights and responsibilities of renters and 
owners? What should be done about the housing problem? and, What are the 
appropriate methods for taking social action in a democratic society? A variety of 
reading selections that included articles from newspapers and magazines, government 
publications, and brochures from political parties and special interest groups were 
provided to each class. Additionally, field studies and interviews with local resource 
persons were sources of relevant data for the students in their communities. 


The introductory lessons consisted of student surveys of the housing market in 
each project community. These surveys, lasting up to three weeks in most cases, 
concluded with the students in each class deciding that a housing problem did exist 
for many people in their community. 

The second part of the housing study pertained to the investigation of the several 
factors that affect the costs of housing. This was divided into three sections: critical 
thinking lessons, reading selections and graphic materials dealing with the conditions 
that were purported by the individuals and special interest groups to result in 
expensive housing, and interviews with local resource persons. 

During the first critical thinking lesson, students were introduced to a general 
concept of critical thinking, and a rationale for developing critical thinking skills in 
the social studies program was discussed. The following lesson had the students 
consider a modified list of the critical thinking skills enumerated by Ennis (1962). 
Additionally, students were taught to use evaluative criteria (selected from Chapman 
& Gross, 1973, p. 190; Chausow, 1965, pp. 79-81; Fraenkel, 1973, p. 263; Fraser & 
West, 1962, p. 281; Quillen & Hanna, 1948, pp. 334-336; Shaver & Larkins, 1973) for 
making the judgments in Ennis’ list. 

Subsequent to the introductory lessons, students listened to a tape recording of a 
radio interview conducted by Vancouver broadcaster Jack Webster, CJOR, with Bruce 
York. York, as President of the British Columbia Tenants Association, had been 
invited on the program to comment about the B.C. Landlord and Tenant Act (June 
20, 1974). The two and one-half hour interview had been edited by the investigator to 
25 minutes, and it included a heated discussion about the feasibility of rent controls, 
the interest rates charged by mortgage companies, and the need for government 
intervention in the housing market. Telephone calls from three listeners were also 
included on the tape. To complete the work sheet accompanying the tape, students 
identified, weighed, and evaluated arguments by both the interviewer and interviewee, 
searched out words and phrases that were emotionally loaded, defined terms in 
arguments, located irrelevant questions and comments, found examples of fact and 
opinion statements, and evaluated analogies. In the majority of classes, the analysis 
‘and discussion of the broadcast was carried on over a minimum of three periods, 
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during which time the tape was played several times. 

The analysis of the radio broadcast was followed by several lessons that consisted 
of an examination of They Build Houses, Don’t They?, a comic book published by the 
Exploding Myths Comic Collection of Toronto, Ontario. The assignment for these 
lessons included questions that had students study frames in order to locate evidence 
of the cartoonist’s bias, identify his point of view, and evaluate his arguments. 
Additional activities had the students speculate as to how the interviewee on the 
radio broadcast discussed earlier might react to the way the housing problem was 
described in the comic book, and as to how landlords, developers, renters, and buyers 
would have been represented had the comic book been prepared by realtors. Finally, 
the students were required to evaluate the comic book and decide whether it 
presented an accurate assessment of the housing problem and to provide evidence to 
support their opinion. 

When presented with data in succeeding lessons, students were encouraged to note 
the source of the information, suggest the frame of reference of the author, and search 
the material for evidence of bias, ambiguity, and contradiction. In addition, they were 
to judge the authoritativeness of the data and the validity of the arguments presented. 


The Control Class 


During the period of the housing study, students in the control classes were 
taught the regular social studies courses from the provincial curriculum guide. In 
schools 1 and 2, instruction consisted of a chapter by chapter study of a textbook on 
urbanization in Canada. In both control classes, questions that required students to 
locate facts, identify main ideas, and reach conclusions were assigned for each chapter. 
Additionally, students in these classes completed a major project — in most cases, a 
model or diagram illustrating some aspect of urbanization. In school 3, social studies 
for the control class centered on a chronological study of Canadian history. During the 
several months required by the experimental students for the housing project, the 
control students progressed in their study from New France, through the British 
Conquest and the establishment of Upper and Lower Canada, to the British North 
America Act of 1867. In addition to class discussions, learning experiences included 
answering questions (similar to those mentioned above) for each chapter of the 
textbook. This approach was also used in school 4. The control class in this school 
studied industrialization in the Soviet Union and the European Common Market. 


Design and Instrumentation 


In four of the eight project schools, two classes were available for inclusion in the 
study. In these schools, class placement in either the experimental or control 
categories was dependent upon the chance toss of a coin. In each of the remaining 
four schools, a single class only was available for the study.' Two research designs were 
employed. The first, a nonequivalent control group design, was used in the schools 
where both the experimental and control classes existed. The second, a one-group 
pretest-posttest design, was used in the single class schools. Because of the obvious 
inadequacies of the latter design, that part of the study was regarded as providing 
data which, if in accord with, would replicate and substantiate the findings with the 
nonequivalent control group design. 


Critical thinking was assessed by the pretest and posttest administration’ of the 
Cornell Critical Thinking Test (CCTT). This instrument consists of 72 items organized 
into four sections that purport to assess competency in most of the critical thinking 
skills listed previously (Ennis & Millman, 1971, p. 3). 


The validity of the CCTT was dealt with by Ennis and Millman (1971, pp. 12-33) 
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through a series of correlations between scores on the CCTT and scores on measures 
of intelligence and of academic achievement. For this study, Pearson product-moment 
r’s were calculated between pretest scores for the project students and IQ scores on 
the Otis Quick-Scoring Mental Ability Test (r = .547 df = 159; p<.01), and between 
pretest scores and reading comprehension scores from the Gates-Macginitie Reading 
Lest ieee. at = 221: p<.01). 

Two reliability coefficients were computed for the project data. The 
Kuder-Richardson 20 formula yielded a coefficient of internal consistency of .91 for 
the pooled classes on the pretest administration of the CCTT. A reliability estimate of 
.87 was calculated using the procedure suggested by Winer (1971, pp. 283-289). 


Analysis 

Two-way classification analysis of covariance with the data arranged according to 
treatment and geographic location, and with IQ and pretest scores as covariates, was 
the basic analysis used for the data from the nonequivalent control group design. The 
general least squares solution for unequal-sized groups was used, as recommended by 
Winer (1971, p. 498). The ¢ test for correlated data was used to test pretest-posttest 
differences for the one-group design data. 

Additional analyses were run to determine whether sex was a significant factor in 
the study. The general least squares solution was used to analyze the data from the 
nonequivalent control group. A one-way classification analysis of covariance (with 
pretest scores as covariates*) was used with the data collected from the one-group, 
pretest-posttest design arranged according to sex. 


Results 


Nonequivalent Control Group Design 

As can be seen in Table 1, the difference between the adjusted mean posttest 
scores for treatment was statistically significant (p<.01) in favour of the experimental 
group. F ratios for “geographic location” and “interaction” were not significant. 


TABLE 1 


ANALYSIS OF COVARIANCE? FOR THE ADJUSTED MEAN POSTTEST 
SCORES ON THE CCTT OF THE EXPERIMENTAL AND CONTROL GROUPS 


Source of Variation Degrees of Sums of Mean F Ratio p 
Freedom Squares Square 

Treatment i 451.04 451.04 7.69 <.01 

Geographic Location 3 66.00 22.20 38 >.05 

Interaction 3 75.01 25.00 43 >.05 

Error 121° 7,100.79 58.68 

Total 130 16,878.84 


a A test for homogeneity of regression yielded an F ratio of .43(df=2, 127;p >.05), indicating that 
differences between regression slopes were not statistically significant. 
b ANCOVA model used for least squares analysis considers covariates as sources of variation. 
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Criterion and control variable means for the pooled data for the experimental and 
control groups are given in Table 2. Table 3 contains the adjusted mean posttest 
scores arranged according to geographic location. 


TABLE 2 


CRITERION AND CONTROL VARIABLE MEANS 
AND STANDARD DEVIATIONS FOR THE SCORES 
OF THE EXPERIMENTAL AND CONTROL GROUPS ON THE CCTT 


Adjusted 
P a sttes J 
IQ retest Posttest ater 
Group Number Mean S.D. Mean S.D. Mean S.D. Mean 
Experimental 70 OT 2k eel 7 I /Go}S) 11.86 24,96 11.30 2AZ 
Control 63 105.43 10.68 ZO) 10.43 25.13 11.40 22.96 
TABLE 3 


ADJUSTED MEAN POSTTEST SCORES ON THE CCTT 
FOR THE EXPERIMENTAL AND CONTROL GROUPS 
ARRANGED ACCORDING TO GEOGRAPHIC LOCATION 


Geographic 
Location Number Experimental Group Control Group . Row Means 
1 38 28.25 2202 25.14 
2 34 DAM 25.02 26.07 
3 23 25.69 21.92 23.80 
4 38 27.40 22.87 25.14 


An Eta’ of .03 was calculated by dividing the sum of squares due to the treatment 
by the total sum of squares (Ferguson, 1976, pp. 236-237). This indicated the 
proportion of variance on the posttest scores attributed to the treatment category after 
controlling for geographic location, and pretest and IQ scores. 


One-group Pretest-posttest Design 

With a difference of six points between the means of the pretest and posttest 
scores (Table 4) and a t value of 4.72 (p<.01), it was concluded that the increase in 
critical thinking scores was greater than would be expected to occur by chance 
fluctuation alone. 
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TABLE 4 


T TEST OF THE COMPARISON OF THE PRETEST AND POSTTEST 
MEAN SCORES ON THE CCTT FOR THE STUDENTS 
IN THE ONE-GROUP PRETEST-POSTTEST DESIGN 


i EE ed Vee ae eee 


Degree of Pretest Posttest 
Number Freedom Mean S.D. Mean SD). t Value p 
66 65 15.41 9.67 21.65 10.75 4.72 <.01 


The pretest and posttest mean scores for the individual classes in the one-group, 
pretest-posttest design are given in Table 5. 


TABLE 5 


PRETEST AND POSTTEST MEAN SCORES 
AND STANDARD DEVIATIONS FOR THE CCTT 
FOR THE STUDENTS IN THE ONE-GROUP PRETEST-POSTTEST DESIGN 


Pretest Posttest 
School Number Mean S.D! Mean S.D. 
5 20 15.90 9.56 24.40 10.17 
6 20 iL eos 9.20 22.60 12.36 
7 20 19.80 8.84 22.60 14.59 
8 6 4.00 2.97 6.17 4.12 


Sex 


The analysis of scores on the CCTT collected from the nonequivalent control 
group and arranged according to treatment and sex yielded nonsignificant F ratios for 
both sex (F = 1.31; df = 1, 1380; p>.05) and the interaction effect of treatment and 
sex (F = .15; df = 1, 130; p>.05). A nonsignificant F ratio (F = .55; df = 1, 64; 
p>.05) was also generated by the analysis of the data from the one-group, 
pretest-posttest design. 


Discussion 


The conclusion that the treatment program had a significant effect upon critical 
thinking skills resulted from the analyses of the responses to the items on the CCTT 
that revealed significant differences between the adjusted mean posttest scores in 
favour of the experimental students in the nonequivalent control group design, and 
between the pretest and posttest mean scores for the students in the one-group, 
pretest-posttest design. This conclusion supported the findings of previous research 
pertaining to critical thinking. That is, it indicated that a program consisting of 
instruction and practice in specific critical thinking skills, within the context of an 
investigation of a contemporary community problem, was a more effective approach 
for teaching critical thinking than was conventional subject matter oriented 
instruction. The Eta? of .03, however, may cause some readers to question the 
educational significance of the conclusion.* 
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Conclusion 


The evidence from this study suggests that a program consisting of especially 
prepared lessons designed to teach specific critical thinking skills as part of an inquiry 
into a community problem is appropriate pedagogy for teaching critical thinking in 
nonacademic social studies classes. The data also suggest that the adequacy of such a 
strategy is unaffected either by the place of student residence or by the sex of the 
students. Furthermore, the evidence implies that critical thinking skills tend not to be 
developed in nonacademic classes through instruction in the prescribed social studies 
curricula. 


Notes 


iL: In small communities in the coastal and interior regions of British Columbia, the entire school population of slow 
learners or nonacademic students for a particular grade may be enrolled in a single class. 


2 The length of the period between the administration of the pre- and posttests was approximately five months. 
5 IQ scores were not available for the students in the one-group, pretest-posttest design. 
4. A perusal of reports of critical thinking research in educational journals for the period 1949-1978 uncovered only 


three studies for which ANOVA or ANCOVA tables were available. Each study recorded statistically significant F 
ratios for the treatment variable. Eta’s of .01 (Hardy, 1971), .02 (Wallen, Haubrich & Reid, 1963), and .08 (Wood, 
Bradae, Barnhart & Kraft, 1970) were calculated from the data contained in these tables. 
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Social Background, Aspirations and 
. Academic Achievement: 
An Analysis Based on Longitudinal Data for Australia 


A model of the social processes involved in the production of individual educational 
achievement is formulated and used to interpret data for Australia. The model is 
based on that proposed by Marjoribanks (1975) and draws on an analytical theory 
of social interaction proposed by Harre and Secord (1972). Both the “needs press” 
paradigm and “social learning theory” are discussed as mechanisms of transference 
in the model, although the data do not allow a test of the validity of these two 
alternatives. Data collected for Australia as part of the International Association 
for the Evaluation of Educational Achievement Six Subject Survey in 1970 plus a 
follow-up survey of students two years later are employed to estimate the model. 


Recent research into the determinants of academic achievement has employed 
relatively few conceptual paradigms as the vehicle of investigation. Most favoured of 
recent has been the “needs-press” paradigm. This paradigm was developed out of a 
theoretical perspective formulated by Bloom (1964) and has been applied to research 
data by Dave (1963), Wolf (1964), Keeves (1972), and Marjoribanks (1972), among 
others. This model can be represented by the equation A, = A, + f(E,,), where Aj, 
A, represent measures of achievement at two points in time and E ,, represents the 
influence of environment acting on the individual between these two time points. “The 
environment is conceptualised in terms of a network of forces which exert press upon 
the individual, with the individual’s cognitive performance being a response to the 
environmental press” (Marjoribanks, 1975, p. 156). 

This paradigm may be viewed as the internal workings of a wider theoretical 
perspective of the phenomenon, the cultural deficit thesis. This thesis argues that it is 
the deficits in the social and cultural backgrounds of individuals which are responsible 
for differences in achievement. Williams (1974) has argued for an alternative 
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mechanism of transference in this model: one based on “social learning theory.” He 
suggests part of the variability in children’s educational performance which can be 
ascribed to between-family variability in environments is a function of: 


1.  between-family variability in the kinds of (cognitive) stimuli parents provide in 
the form of opportunities to interact with a wide range of physical stimuli, and in 
the form of opportunities to interact with appropriate models for these 
behaviours; 


2.  between-family variability in the nature of reinforcement for appropriate 
performance, and in the reinforcement contingencies and schedules parents use; 
and 


3. between-family variability in the encouragement parents provide for appropriate 
performance, and in the expectations they hold out for the child’s level of 
performance on these behaviours. 


Marjoribanks (1975) proposed a model for the analysis of this process, based on an 
analytical theory of social interaction provided by Harre and Secord (1972). They take 
as their focal active unit the individual, who consciously acts in such a way as to guide 
his own behaviour toward some explicit goal. Action by the individual is thus seen as 
deliberate and goal specific. Harre and Secord have labelled the conditions which are 
required for individuals to have the power to monitor their own performance, the 
“enabling conditions.” Within the context of the needs-press paradigm these enabling 
conditions can be viewed as the behavioural manifestations emanating from the 
presses for achievement, motivation, independence, intellectuality, etc., and in the 
context of the social learning paradigm they are the behavioural manifestations 
emanating from cognitive stimuli, reinforcement schedules, and 
encouragement/expectations held out by parents and teachers. 


In the context of the present study these enabling conditions are postulated to 
consist of characteristics such as attitudes toward school and educational and 
occupational aspirations and expectations. 

The model employed here takes individual cultural deficits and mediates their 
effect on achievement via these enabling conditions. Such a model may be represented 
as: 

At, = f(b,Dt,, b,E, ,,.b,At,) 


where achievement at time 2 (At,) is a function of cultural deficit in background 
conditions at time 1 (Dt,), a set of enabling conditions current between times 1 and 2 
(E,,) and achievement at time 1 (At,). 

The simplest form of this model is linear-additive. Nonlinearity of particular 
measurement scales was dismissed as a substantive problem, as_ standardised 
regression coefficients were to be used in interpreting the relationships (c.f. Boyle, 
1970). An additive form of the model was chosen for two reasons: (a) so as to give 
results which could be directly compared to those found in the earlier study by 
Marjoribanks (1975) and, (b) so as to yield straightforward results of the most 
parsimonious interpretation of the model. To have chosen a multiplicative form of the 
model would have introduced problems similar to those found in interpreting 
commonality analysis when several variables are involved (c.f., Wold & Noonan, 1976). 
The above form of the model implies that cognitive performance is an additive 
function of initial cultural conditions prevailing in the child’s home and that the 
influence of these conditions are mediated via enabling conditions in their effects on 
cognitive achievement. However, as the constructs representing the enabling conditions 
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each have residuals attached, the initial conditions are not considered to determine 
them completely. What these residuals represent is left unanalyzed within the model, 
although they may be thought of as representing both measurement error and 
ignorance of the full determinants of a particular construct. 


Data 


The data employed were those collected by the Australian Council for Educational 
Research (ACER) as a participant organization in a survey conducted by the 
International Association for the Evaluation of Educational Achievement (IEA). The 
data collected (during 1970) in Australia covered only one of the six subject areas — 
science — for two school populations, known as population 2: all students aged 14.0 to 
14.11 years at the time of testing (this was the last point in most of the national 
school systems surveyed where 100% of an age group were still in compulsory 
schooling), and population 4: all students who were in the terminal year of those 
full-time secondary education programmes which were either pre-university 
programmes or programmes of the same length. Two years after this initial survey 
ACER conducted a follow-up survey of participants in population 2. Separate 
questionnaires were developed for those who were still at school and those who had 
left school. 


The data employed in the present study were drawn from the population 2 data 
and its associated follow-up survey of those students still in full-time schooling at that 
time. 


The actual sample used in the analysis comprised a random selection from 
students for whom data was available for all variables in the model (i.e., listwise 
deletion of cases was employed in estimation of model parameters). An analysis of the 
representativeness of the data was presented by Rosier and Williams (1973). With 
respect to the marker variables (distribution of students in the grades of the defined 
target population, sex of students, and distribution of students in government and 
nongovernment schools) employed in the initial (1970) sample, Rosier and Williams 
stated, “In general there is a fair agreement between the population and sample 
values” (1973, p. 31). Due to the method of backward selection used to obtain the 
sample used here, it was predicted, on the hypothesis that the lower achievers were 
more likely to have left school, that the sample was biased upward with respect to 
achievement. This hypothesis was supported at the ten per cent probability level; 
however, there was a relatively small increase in the average scores of the sample 
employed, compared to that including the school leavers as well. The difference in the 
mean performance scores for the two samples was five per cent of the standard 
deviation of the distribution. Caution thus suggests that it is not possible to generalize 
the results of this study to the students who had left school before the age of 16 years. 


Construction of Variables 

The formation of particular constructs was guided by the need to adapt the 
“modelling aspirations” of the overall model to the state of theoretical development of 
each construct and the particular information available. A further consideration was 
that of restricting the number of variables to the number of constructs in the model. 
This restriction was considered necessary for the model to yield reasonably 
straightforward interpretative results. A proliferation of variables within constructs 
would have led to a much more detailed model, but one which would have contained 


greater ambiguity in interpretation. > 
The method employed in building the construct variables was to take principle 


component weights or factor scores after varimax rotation within a canonical factor 


model (Allan, 1974). 37 
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In those cases where a factor analysis did not reduce the construct to a single 
major factor, the decision of whether to use principle components or rotated factors 
was guided by three criteria: (1) where the concept of the construct was ambiguous in 
its causal linkage with other variables, the first principle component was used as a 
global index of the concept, (2) where a construct consisted of two or more clearly 
defined faGtors and one factor was clearly more indicative of the concept in question, 
the factor scores for that factor were employed in constructing the variable, and (3) 
where a subset of the indicators contributed to all factors in a construct which was 
relatively unambiguous in its causal linkages, this subset of indicators was refactored 
to produce second-order factors. In the two cases where these criteria were applied, 
the second-order analysis produced just the one factor’. 


Table 1 lists the constructed variables of the model according to the method by 
which they were derived. 


Social Background 
X1 Parent’s education 
X2 Socioeconomic 
status 


Expectations and 
Attitudes 
X11 Parent’s 
occupational 
aspirations for student 
X12 Parent’s 


Expectations and educational 
Attitudes expectations for 


X17 Academic 
achievement 


Sex of Student 


student 
X3 Sex of student X7 Student’s X10 Academic X13 Parent’s interest in 
occupational achievement student’s education 


X14 Student’s 
occupational 


expectations 
X8 Student’s 


educational 


expectations aspirations 
X9 Student’s attitude X15 Student’s 
to school educational 
expectations 
Home Learning X16 Student’s attitude 


Environment to school 

‘amily 
configuration 

X5 Educational 


environment 
X6 Migrancy 


Initial Cultural Enabling Conditions Achievement at Enabling Conditions Achievement at 
Conditions at Time 1 Time 1 at Time 2 Time 2 


Figure 1. Causal ordering of variables in the model. (The causal ordering moves from left to right. All 
variables carry a residual [not shown].) 
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TABLE 1 


DESCRIPTION OF VARIABLES 


aye Method of 
Name Description Construction % 
X1 =~Parent’s education PAG: 
X2 Socio-economic status Re 
X3  Student’s sex Sk 
X4 Family configuration: family size and sibling position PGs 
X5 Educational environment of the home PG: 
X6 Migrancy: language spoken in home and length of residence in Australia R.F. 
X7 Student’s expectation of future occupation at age 14 5.1: 
X8 Student’s expectation of years secondary schooling at age 14 Salk 
X9 Student’s attitude to school at age 14 PG. 
X10 Student’s cognitive performance (science score) at age 14 S.1. 
X11  Parent’s occupational aspirations for student at age 16 EAC: 
X12 Parent’s expectation of student’s future education at age 16 R.F. 
X13  Parent’s interest in student’s schooling at age 16 Pac: 
X14 Student’s occupational aspirations at age 16 PC 
X15 Student’s expectation of further education at age 16 PGs 
X16 Student’s attitude to school at age 16 PAC 
X17 Student’s cognitive performance (science score) at age 16 5.0L 


“P.C. - principle component 
S.J. - single indicator 
FAC ~- Vvarimax rotated factor 
RF - refactored factor 


Estimation of the model 

Structural models such as that in focus in this research can be assessed in two 
different respects: (1) internal validity — the adequacy with which a proposed structure 
is able to describe the total set of relations among a set of observed variables while 
using a more parsimonious set of relations, and (2) external validity — the adequacy 
with which a proposed structure is able to predict theoretically endogenous variance. 

As the focus of this research is to investigate the influence of particular variables 
on achievement and the macro process by which these influences are mediated within 
a presupposed theoretical framework, it is external validity which is the main concern 
here. External validity “is solely concerned with the ability of the structure as a source 
of prediction, in comparison to other structures which specify different causal 
structures” (Burt, 1973, p. 145). This approach assumes that the model is internally 
valid, ie., that it is correctly specified. Although it would be desirable to test the 
internal validity of the models empirically, the lack of suitable test statistics for large 
models means that such analysis on the models employed here is not feasible. A 
further obstacle to the investigation of internal validity of large models is the necessity 
of including neazly all causal paths (even those which have no substantive 
interpretation) in the model, if it is not to be rejécted as invalid. This is mainly a 
consequence of the relatively greater influence of measurement error and complexity 
of causation, as model size increases. The analysis of large models, however, is usually 
undertaken in order to investigate the influence of certain key variables in an 
expanded theoretical framework, i.e., the scientific tenet of parsimony within a wider 
subject area is the objective of the research. The approach taken in this study is 
broadly in line with this latter objective. Regression estimates of the coefficients have 
been made for each equation separately. The coefficient of determination can be 
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employed as a guide to the specification error contained in any particular equation. 
No formal attempt has been made to take account of measurement error in the 
variables. Differences in subgroup means have been accounted for by adding the 
relevant control variable to the model. The only case for which this was considered to 
be potentially significant was in the case of student sex. 


The problems raised by differential repetitiveness and unequal redundancy 
between subsets of variables within an equation have been the focus of the methods 
employed in the construction of the composite variables and specification of the 
model. In this process, the aim was to derive variables which could be employed as 
measures of clearly defined conceptual linkages in the model. A degree of redundancy 
may however be found in the use of the variables employed as measures of parental 
ability and socioeconomic status. Although these are conceptually distinct, the 
variables used to operationalize these concepts are not so clearly differentiated. In 
order to overcome this problem only the variable of strongest influence will be 
included in an equation. Also, whether or not the alternative variable has a significant 
coefficient, once this has been done, is reported. 


Results and Discussion? 


The zero-order correlations presented in Table 2 indicate a set of low to moderate 
relationships among the variables of the model. Regression coefficients are presented 
in Table 3. In general the predictive power of the equations is rather low. Prediction 
of achievement at time 2 was influenced most by previous achievement; however, the 
model was not very successful (R? = .19) in predicting this earlier performance. 
Enabling conditions do not show a major influence on prediction of achievement, 
although students’ expected education had a consistent influence in both cases. 


TABLE 2 


CORRELATIONS* 


X1 X2 X3 X4 X5 X6 X7 X8 X9 POD POUR POY OSI D.GES OGY DED. 


X16 03 09 16 12 17 -04 32 31 48 18 31 40 06 27 42 
X17 15 08 -22 04 02 -02 34 36 07 75 21 11 -08 yl 28 -09 


7 All correlations multiplied by 100 


The measures of social background and home environment were generally much 
less powerful predictors of both enabling conditions and cognitive performance than 
had been expected. Coleman (1975) argued that home environment and _ social 
background variables have a differential effect upon achievement according to the 
measure of achievement employed. Cognitive achievement as measured by 
performance in subject areas taught principally in the school (e.g., science, foreign 
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languages) is influenced less by these variables than that which is more dependent on 
learning in the home as well as the school (e.g., arithmetic and reading). The measure 
of cognitive achievement employed in this study was performance in science; the 
results found are similar in this respect to those reported by Marjoribanks (1975) who 
employed a measure of cognitive performance based on reading tests at time 1, and 
reading, arithmetic, and nonverbal tests at time 2. 


The variable for sex of the student yielded some particularly interesting results. It 
had been expected that a general sex bias against girls would show up in most of the 
intervening variables in the model. This was found to be the case in the prediction of 
achievement at both times 1 and 2 and for parents’ expressed interest in the student’s 
education at time 2. (Note: The data did not include measurements on this variable at 
time 1.) This finding of a generally superior performance by boys, compared to girls, 
confirms the result reported by Keeves (1973). Although the data can themselves shed 
no direct light on the underlying reasons for this, it seems reasonable to suggest that 
it may be an indirect result of the sexual socialization which appears to dominate life 
in Australian society. Kelly’s conclusion, based on a review of the literature related to 
the differential participation of the sexes in science and science education, that “At 
school and at work science has a masculine image, and is often rejected as unsuitable 
for girls; boys are encouraged from early childhood to take an interest in mechanical 
and scientific things, but girls are not” (1976, p. 1103), lends strong support to this 
argument. Further support is provided by the finding that parents expressed more 
interest in the progress of the educational careers of boys than girls. 


The fact that there was no significant difference in expected education between the 
sexes at time 1, but that girls had a more positive attitude toward school suggests, 
however, that girls were managing to overcome much of the general negative effects of 
this socialization on educational expectations. Also, there was no significant difference 
between boys and girls in their occupational expectations at time 1, but girls were 
showing higher aspirations than boys at time 2. Further, girls had a more positive 
attitude to school at time 2 than did boys. Although these findings were not initially 
expected, they may be due to differences in the way occupational aspirations develop 
for boys and girls. If the occupational aspirations held by boys develop later than 
those of girls, this result may be due to an artifact of the research design. However, it 
may reflect a different pattern of their development between the sexes. One such 
pattern which would be consistent with the findings is based on the argument that as 
occupational aspirations are viewed as being more important to male futures than 
those of females, they are more dependent on “hard” data such as those evidenced by 
progress at school. In this case we may observe a suppressor variable effect. Girl’s 
aspirations may not be significantly influenced by school progress, while those of boys 
are. Further, it may be posited that positive signs of progress for boys merely confirm 
self and parental aspirations, i.e., do not raise them, but that signs of no, or slower 
than expected, progress depress the level of these aspirations. The more positive 
attitude of girls to school could then be explained by the different meaning of 
schooling experience for girls compared to boys. An hypothesis based on the above 
argument is indirectly supported by the results for the equation predicting students’ 
attitudes to school at time 2, after controlling for student sex (assuming again that the 
sex is acting as a suppressor variable). 

Family configuration was found to have a moderate influence on occupational 
aspirations held by the student at time 1. A student from a large family who had older 
brothers and sisters was likely to have lower occupational aspirations. Children from 
families who had recently arrived in Australia or whose country of origin was not an 
advanced industrial nation, had slightly lower educational and occupational aspirations 
but these migrancy factors did not influence the child’s attitude to school 


a2 


Aspirations and Academic Achievement 


independently of her/his educational and occupational aspirations. The educational 
environment of the home had a significant positive influence upon the student’s 
attitude to school at time 1, but no independent effect upon cognitive achievement. 
Socioeconomic status (measured by employment of family head) had only a small 
influence on any of the variables at time 1; however, parental education was found to 
have a moderate influence on achievement at time 1. Both socioeconomic status and 
parental education had no significant influence in predicting variables at stage 2 once 
the enabling conditions and achievement at time 1 had been controlled for, with one 
exception. The exception was in predicting parent’s occupational aspirations for their 
children, where parents’ education had a moderate positive influence. Family 
configuration also exerted an influence on parents’ occupational aspirations for their 
children, the parents of larger families having lower occupational aspirations. 


Generally, the results can be interpreted as supporting a model in which social 
background and home environment influence early aspirations and cognitive 
achievement. These aspirations and achievements then mediate this influence on later 
achievement and aspirations. The findings support a stereotype of the achieving child 
as coming from a home where both parents and the child have high educational and 
occupational expectations, the family is small, and the child has done well 
academically in earlier schooling. Because of social attitudes about the role of women 
in our society,, boys perform better than girls on this measure of cognitive 
achievement. The model is relatively unsuccessful in indicating the causes of why some 
families provide this type of environment and others do not. The social background 
and family environment influences found in the model give weak support to the 
cultural deficit thesis. 

The relatively low predictive power of many equations in the model implies either 
that there are major omissions with respect to predictor variables, or that 
measurement error is a serious problem in the model. A_ possible source of 
measurement error in the variables measuring aspirations/expectations for education 
and occupation is that due to the “frog pond effect.” This effect argues that it is the 
strength of aspirations/expectations held by an individual relative to those held by a 
particular social group which are of greatest influence. In the present context this 
would imply that aspirations/expectations should be measured relative to those held 
by classmates. Some evidence in favour of this effect has been found by Toomey 
(1976). He found that student expectations measured relative to their classmates had 
greater predictive power than expectations measured relative to students across 
schools. Further, McDonnell and Beswick (1977) have produced recent evidence that 
this effect is important for predicting educational intentions among final grade 
students in Australia. It is thus quite possible that a replication of the analysis which 
measured affective attributes of students and their parents within the terms of 
reference group theory may give much higher predictive power to many of the 
equations, and hence, greater external validity to the model. Some initial attempts to 
develop a reference group theoretic orientation to the conceptualization of parental 
and student effective inputs with respect to social class, school catchment and within 
school sources is presented in Cuttance (1979). The main thrust of the argument 
presented there is based on the proposition that significant reference groups are 
formed within boundaries imposed upon groups of individuals by social, economic and 
organizational contraints due to processes of selection in society at large and within 
individual schools. 

Another source of improvement in the external validity of the model would ii to 
employ purposely designed direct measures in place of the abstracted (factor) in e 
used here. However, as the investigation was based on a secondary analysis of the ve 
set, it was not possible to follow this “ideal” path of measurement for the concepts 
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incorporated in the model. Further, an alternative form of analysis based on log-linear 
regression would allow the incorporation of interaction effects in the model. Any 
decision regarding the incorporation of such effects is dependent upon one’s 
aspirations with respect to the modelling process and would indicate an aim to 
investigate the form or relationship between the various concepts in the model. The 
aim in this study was, however, to consider the relationships at a macro level and to 
replicate the model proposed by Marjoribanks (1975); therefore, interaction effects 
were not included. 


Notes 
i Further information on the factor scores and selection of indicators for inclusion in each factor is available from 
the author. 
2 A short report of the findings discussed here is to appear in a special issue of Studies in Educational Evaluation 
devoted to work based on IEA data. 
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Equality of Opportunity in American and Canadian 
Graduate Education: A Comparison of Gender Differences 


Two aspects of equality of educational opportunity -— access and treatment — in 
American and Canadian graduate schools are examined in this paper. To assess 
equality of access, data are presented on degrees awarded to women and on 
full-time faculty positions held by women in the two countries. Equality of 
treatment is assessed by examining studies of professional socialization in selected 
graduate schools.The results show that women are underrepresented as both faculty 
and graduate students in both countries. Moreover, sex differences in faculty and 
peer group interaction suggest that women in the United States and Canada do 
not receive equality of educational opportunity. 


Education is a highly valued commodity in both American and Canadian societies. 
Ostensibly (given these nations’ adherence to an ideology of opportunity), it is 
available at all levels to both women and men. This paper examines the concept of 
equality of educational opportunity for women and men in American and Canadian 
universities, with specific reference to graduate education.! 


Equality of educational opportunity involves two aspects, namely, equality of 
access, and equality of treatment (Harvey & Lennards, 1973). Both of these notions 
will be considered in this paper, with emphasis on the latter. Equality of access 
implies access to institutions of higher learning, and the receipt of degrees. Equality of 
treatment implies being treated and rewarded in such a way that there is motivation 
to pursue higher education and motivation to finish degrees. It implies access to 
experiences that produce such motivation. 


In order to assess equality of access to higher education, data are presented on 
degrees awarded to women, as well as the proportions of full-time women faculty in 
American and Canadian universities, using national statistics. For an assessment of 
equality of treatment, an examination of interactional patterns of women and men 
students in selected American and Canadian graduate schools is undertaken.’ The 
theoretical perspective used is a model of professional socialization, based primarily on 


symbolic interaction theory. The following section reviews the literature and outlines 
the features of the perspective which have particular relevance for this analysis. 
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University of Calgary. Her teaching and research interests include the sociology of gender relations, stratification, professional 
socialization and Quebec Society. Among her publications are studies of various aspects of gender differences in education. She 
is currently undertaking research on managerial women in Canada and France. 
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Professional Socialization in Graduate School 


In the sociology of the professions, a good deal of attention has been devoted to 
the question of professional socialization, particularly in the United States. However, 
until recently, in both the United States and Canada, sex was not considered as an 
independent variable. Studies focused either on the socialization of men into 
masculine® occupations such as law, medicine, engineering and academe (cf. Becker, 
Geer, Hughes & Strauss, 1963; Becker & Carper, 1956a, 1956b; Carper & Becker, 
1957-58; Lortie, 1959: Merton, Reader & Kendall, 1957; Warkov & Zelan, 1965 in the 
United States, and Clifton, 1976; Reid, 1974 in Canada), or the professional 
socialization of women into feminine occupations such as nursing or teaching (cf. 
Davis & Olesen, 1963, 1965; Olesen & Whittaker, 1968; Simpson, 1967 in the United 
States, and Ondrack, 1974 in Canada). Women entering masculine occupations 
represented such an anomaly to researchers that they were excluded from the 
analysis.‘ In more recent studies, sex differences tended not to be analyzed or were 
done so superficially. 

Paralleling the resurgence of feminism as an influential social movement in the 
West is a heightened awareness of the condition of women. This cognizance is 
reflected in an increasing volume of academic research on women, a good proportion 
of which focuses on women in academe. In the United States, many such studies may 
be cited (cf. Astin, 1969; Bernard, 1964, 1971; David, 1969; Graham, 1978; Hughes, 
1973; LaSorte, 1971; Mitchell & Starr, 1971; Patterson, 1971; Rossi, 1970). The number 
of studies focusing on women in Canadian universities has also mushroomed (cf 
Adam, 1971; Ambert, 1976; Andersen, 1974; Archer, 1973; Day, 1973; Hitchman, 1974; 
Robson & Lapointe, 1971; Symons, 1978a) and has included reports on the status of 
women at various Canadian universities such as the University of Alberta (Report of 
the Committee on the Status of Women Academics at the University of Alberta, 1975), 
the University of British Columbia (Day, 1973), the University of Calgary (Report of 
the Committee on the Status of Women at the University of Calgary, 1977), McGill 
University (Report of the Senate Committee on Discrimination as to Sex in the 
University, 1971), McMaster University (McDonald & Lenglet, 1973), Memorial 
University (Report on Sex Discrimination in Faculty Salaries at Memorial University 
of Newfoundland, 1974), Queen’s University (Report of the Principal’s Committee on 
the Status of Women at Queen’s University, 1974), the University of Toronto (Report 
of the Committee on Employment Conditions of Full Time Faculty, 1974), the 
University of Waterloo (Report of the President’s Advisory Committee on Equal 
Rights for Women and Men, 1973), York University (Report of the Senate Task Force 
on the Status of Women at York, 1975). However, few of these studies have dealt with 
the professional socialization of graduate students per se. 

Some recent American studies of women entering masculine occupations have 
addressed the socialization issue and have analyzed the case of graduate students in 
the university (cf. Acker, 1974; Adler, 1976, 1978; Feldman, 1973, 1974; Holmstrom & 
Holmstrom, 1974; Kjerulff & Blood, 1973; Sells, 1975; Solomon, 1976). Professional 
socialization involves complex interaction patterns, and an analysis of the process 
focuses on interaction in the organizational setting, and the impact of the environment 
on individual perspectives, motivation and behaviour. Considering the case of 
socialization in a predominantly masculine profession, one is led to ask if the process 
is the same for men who are fulfilling cultural prescriptions as it is for women, for 
whom such endeavours represent “non-traditional” occupational choices. 

For the purposes of this analysis, two aspects of the professional socialization 
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process are examined, namely, interaction with professors (including formal and 
informal interaction, the nature or quality of the relationship, and the sponsorship 
system) and contact with peers in the graduate school setting (including academic and 
nonacademic encounters). 


As the many American studies on professional socialization have demonstrated (cf. 
Baird, 1969; Becker & Carper, 1956a; Gottlieb, 1961-62; Rosen & Bates, 1967), both 
formal and informal participation with faculty members is crucial for the professional 
socialization process. By interacting with faculty members, graduate students acquire 
knowledge and skills, as well as the more covert rules of the game of the academic 
profession. Such interaction is also important for the development of a professional 
self-concept. Informal participation is defined as interaction that takes place outside 
of the office or classroom. In this situation, academic questions may or may not be 
discussed, but if they are, such discussion takes place in a more relaxed and informal 
atmosphere. Such activity with professors reinforces professional identity and aids in 
the professional socialization process, since it indicates to those students concerned 
that they are being admitted into “the club.” Moreover, important research leads, tips 
about jobs, etc., may be passed on in such encounters. 


In analyzing faculty student interaction an attempt is made to gauge the nature or 
quality of the relationship. This is a significant area of analysis for a number of 
reasons. Heiss (1967) states, for example, that 


The quality and character of the relationship between the doctoral student and his 
major professor is unequivocally the most sensitive and crucial element in the 
doctoral experience, since it not only influences the graduate student’s scholarly 
development but also has far-reaching after effects. (p. 39) 


Moreover, as Gregg (1972) notes, collegial relationships between students and faculty 
are positively related to student satisfaction with graduate education. 


One final aspect of interaction with professors explored in this paper is the 
sponsorship system, which, as an ideal type, operates in the following manner. A 
student is chosen by a professor, or chooses and is accepted by a professor, learns 
her/his style and method, is perceived as an academic successor, and is later assisted 
in finding her/his first employment. Although this system is more prevalent and more - 
pronounced in the natural sciences than in the social sciences or the humanities, it is 
nonetheless in operation in these latter fields as well. As Becker and Carper (1956a, 
pp. 297-298) point out, this system involves a set of interlocking responsibilities. The 
sponsor is accountable to her/his colleagues for presenting a certain student for hire 
and the student is in turn responsible to the sponsor for her/his behaviour. The 
protégé is expected to act responsibly as a skilled and dedicated member of the 
profession on a long-term basis. This obligation helps solidify the occupational identity 
of the graduate student. 

Peer group interaction, in terms of both academic and nonacademic encounters, is 
also important for professional socialization. Graduate students may act as a reference 
group for one another, reinforcing group norms, setting standards, and monitoring 
behaviour. Many researchers (Baird, 1969; Becker et al., 1963; Becker & Carper, 1956a; 
Feldman, 1974; Rosen & Bates, 1967; Wright, 1964) note the importance of student 
culture for shaping attitudes and behaviour. 


Methodology 


Two sources of data have been employed in this paper. National statistics on 
degrees awarded to women and the proportion of full-time faculty positions held by 
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women in Canadian and American universities have been gathered for selected years. 
Unfortunately, such comprehensive information on professional socialization in 
graduate schools in the two countries is not available. 


To this author’s knowledge, only one systematic study has been done comparing 
the condition of women to that of men in Canadian graduate schools (Hitchman, 
1976), and it is from this study that the Canadian data on professional socialization 
have been generated. Two Canadian metropolitan universities in Toronto, Ontario, 
were chosen for study, the University of Toronto and York University. Included in the 
sample were all women Ph.D. candidates enrolled full-time and in residence during the 
1974-75 academic session, and enrolled in a Ph.D. programme offered by both 
universities. A corresponding sample of male Ph.D. students, matched in terms of 
discipline, university, and year of study was drawn. The academic disciplines chosen 
for study were categorized as natural science, including biology, zoology, chemistry and 
physics; social science, including political science, psychology, history, sociology, social 
and political thought; and humanities, consisting of English and philosophy. 
Questionnaires were mailed to Ph.D. candidates in the period between November 1974 
and February 1975. Three hundred and fifty-nine (63%) usable questionnaires were 
returned, the final sample consisting of 176 women and 183 men. One hundred and 
twenty-two, or 34% of the sample, were York students and 237 respondents, or 66%, 
were from the University of Toronto. In the spring of 1975, 21 intensive interviews 
were conducted with a subsample of both male and female respondents. 


The findings from this study are compared with those derived from various 
American studies, in particular, those of Acker (1974), Feldman (1973, 1974), 
Holmstrom and Holmstrom (1974), Kjerulff and Blood (1973), and Sells (1975). 
Feldman’s, Holmstrom and Holmstrom’s and the major part of Sells’ analyses were 
based on data derived from a national sample of faculty and students sponsored by 
the Carnegie Commission of Higher Education in conjunction with the American 
Council on Education. Feldman’s sample consisted of 32,963 students enrolled in 
graduate programmes (MA and Ph.D.) and graduate professional school programmes 
in 158 American colleges and universities. Holmstrom and Holmstrom excluded 
professional degree and master’s students, selected a random sample of every third 
man (3,401 cases), and included the total group of women doctoral students (2,815). 
Sells used a subsample of the Carnegie sample, including only those institutions 
designated as “high quality universities.” Kjerulff and Blood sampled 41 males and 27 
females in one graduate Department of Psychology in an American university. Acker’s 
study was based on a sample stratified by sex and field of study, of 975 
American-born graduate and professional students (MA, Ph.D., and first professional 
degree) in science, social science, humanities, medicine, law, education, and social 
work, in one large midwestern university in the United States. 

The author recognizes the limitations of the data base. Ideally, a national sample 
of both Canadian and American graduate schools would provide the appropriate data 
for comparing equality of treatment. Of the studies utilized in this analysis, those of 
Feldman, Holmstrom and Holmstrom, and Sells provide such a sample, although 
restricted to the United States. One does not know if the other studies are 
representative of graduate schools in that country. The Canadian data cannot claim to 
be representative, restricted as they are to two graduate schools in one province. 
However, the schools chosen for analysis are not uninteresting in their own right. Both 
are situated in one of Canada’s largest metropolitan centres, and the University of 
Toronto is recognized as one of the most prestigious schools in Canada. York 
University, on the other hand, has some interesting characteristics. As an example of 
the trend in Canadian university expansion in the 1960’s, it represents an attempt to 
develop a quality graduate school in a very short space of time. 
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While the data on equality of access are representative, the uncertainty of the 
representativeness of the data on equality of treatment restricts their generalizability 
to all American or Canadian graduate schools. However, the comparisons made in this 
paper are worthy of note, and provide a first step in analyzing similarities and 
differences in equality of opportunity in graduate education in American and 
Canadian universities. 


Results 
Equality of Access 
Equality of access to higher education for women and men is measured in terms 
of degrees granted, and faculty positions held. In the United States as well as in 
Canada, women are underrepresented as both faculty and students. Table 1 presents 
data on participation of women in Canadian and American universities. A number of 


TABLE 1 


PROPORTION OF DEGREES AWARDED TO WOMEN IN CANADIAN 
AND AMERICAN UNIVERSITIES, AND PROPORTION OF WOMEN 
AS FULL-TIME FACULTY FOR SELECTED YEARS 


Bachelor’s and First Full-time 

Year Proicssional Degree Master’s Doctorate Women Faculty 

ri. a 

Canada Us: Canada USS. Canada US. Canada USS. 

% % % % % % ta % 

1919-20/ 
1920-216 18.3 34.2 22.0 30.2 4.2 15.1 NA 26.3 
1929-30/ 
1930-31 25.3 Bice) 21.4 40.4 TorZ 15.4 NA PAIS?) 
1939-40/ 
1940-41 24.1 41.3 10.5 38.2 6.7 13.0 NA 27.6 
1949-50/ 
1950-515 20.3 23.9 13.9 29.2 5.4 9.6 NA 24.5 
1959-60/ 
1960-61» 25.7 35.3 19.0 31.6 8.5 10.5 Ne 22.0 
1969-70 38.3 41.5 2185) O9nT 9.3 13.3 13.1 25.0 
1970-71 38.0 43.4 22:0) 40.1 9.3 14.3 12.8 23.6¢ 
1971-72 39.4 43.6 24.8 40.6 9.3 15.8 13.0 Paik 
1972-73 39.8 44.9 26.8 41.4 11.2 17.9 13.0 21.4 
1973-74 41.5 44.2 Dee 43.0 12.3 Oo 13.0 22.6¢ 
1974-75 44.4 45.3 Pole 44.8 16.1 21.3 13.9 Decil 
1975-76 46.3 45.5 30.5 46.4 18.8 22.9 14.1 24.3 


Sources: Dominion Bureau of Statistics. Survey of higher education Part 2. Degrees, staff, and summary. 1964. 
Statistics Canada. Education in Canada. A statistical review. Catalogue Number 81-229; 1973, 1974, 1975, 


1976, 1977. 
U.S. Department of Health, Education and Welfare. Digest of education statistics. Catalogue number 


74-7662. 1974, 1975, 1976, 1977-78. 


4 Cross-national data on women faculty are not strictly comparable, since included in the American data are institutions 


that would not be classified as universities in Canada. 
t ! L ; : 
’ American data are for the first year, and Canadian data are for the second year listed. 


© Estimated. 
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trends are worthy of note. For both countries the attrition rate of women increases as 
one moves up the academic ladder. In 1975-76, for example, women in Canada earned 
46.3% of all bachelors and first professional degrees, 30.5% of master’s degrees, and 
18.8% of Ph.D. s. During that same year women made up 14.1% of all full-time 
faculty members in Canadian universities. The corresponding data for American 
schools are 45.5% of first degrees, 46.4% of master’s, 22.9% of doctorates, and 24.3% 
of the faculty. This attrition rate is consistent over most of the years listed, both in 
Canada and the United States. 

A second observation of interest concerns change over time. The greatest increase 
occurs in terms of first degrees. From 1920 to 1975 the proportion of women receiving 
bachelor’s and first professional degrees increased by 28 percentage points in Canada 
and by 11.3 percentage points in the United States. Master’s degrees awarded to 
women have increased by 8.5 points in Canada, and 16.2 points in the United States. 
The increase in doctoral degrees has not been steady in either the United States or 
Canada. Indeed it was not until 1975 that Canada surpassed the proportion of Ph.D’s 
that were awarded to women in 1930! A similar phenomenon occurred in the United 
States. As for women faculty, there has been a very slight increase in Canada (2.4% 
from 1960-1970), while the United States has experienced a decline from 1920.° 

Finally, a comparison of the American and Canadian data demonstrates that 
American women have fared better than their Canadian counterparts in terms of 
receipt of university certification. For every year and degree level (save B.A.’s in 
1975-76), women in American schools received proportionately more degrees than 
women in Canada. As noted in Table 1 the data on women faculty are less comparable 
cross-nationally than are those dealing with degrees. However, it does appear that 
American women have more access to academic careers than do Canadian women. 


This sketch does not tell the whole story, however, for here we are concentrating 
on only one aspect of the concept of equality of educational opportunity. We now turn 
to the analysis of the second aspect, namely, equality of treatment. 


Equality of Treatment 

By way of background, comparison is presented of some demographic 
characteristics of the graduate students enrolled in the American and Canadian 
schools in the sample. The students in the Canadian sample average 28.6 years of age 
for women, and 27.2 years for men. These results are similar to those found for 
American schools, where in Kjerulff and Blood’s (1973) sample, men averaged 25.4 
years of age, and women, 26.8 years, and in Acker’s (1974) sample, where 35% of men 
and 29% of women were between 25 and 29 years of age, with 47% of the men and 
51% of the women being under 25. However, it does appear that Canadian graduate 
students are slightly older than their American counterparts. This could be a function 
of the location of the Canadian schools under study. In Ontario, five years of high 
school are required to obtain entrance into university. Moreover, since Canadian 
schools tend to be commuter institutions, it is reasonable to assume that a good 
proportion of the students took their high school training in Ontario. 


Some noteworthy differences between American and Canadian schools are seen in 
terms of the citizenship status of the students. In the Canadian sample, 46% of men 
and 49% of women were born in Canada. Thirty-one percent of the men and 26% of 
the women are American. (In terms of citizen status, more women (65%) than men 
(55%) hold Canadian citizenship.) Comparison of these results to those of Feldman 
(1974) shows that only 6% of the graduates in those American schools are not United 
States citizens, as compared with 40% of the Canadian sample who are not Canadian 


citizens. 
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Considering religious preference, over one-half of both Canadian women (55%) 
and men (61%) have no stated preferred religion. These figures are considerably 
higher than those reported in American studies. Acker (1974) found that 25% of 
women and 27% of men in her sample reported their religion as agnostic or atheist, 
while Feldman states that 24% of the men and 19% of the women listed their current 
religion as “none.” However, the differences could perhaps be caused by variation in 
the wording of the questions.’ In both American and Canadian schools, the most 
common religion among men and women is Protestantism, followed by Catholicism 
and then Judaism. 


The socioeconomic status of graduate students in American and Canadian schools 
is similar, that is, essentially white-collar, professional middle class. In the Canadian 
sample, 75% of women and 67% of men have fathers engaged in white-collar 
occupations, with 53% of the fathers of women and 48% of the fathers of men 
engaged in professional or managerial occupations. (Twenty-two percent of the women 
and 16% of the men have mothers who are professionals or managers.) These data are 
similar to those of Acker, who found that 77% of women and 76% of men come from 
white-collar homes, while in Feldman’s sample, 64% of women and 62% of men do so. 
The somewhat lower SES of students in Feldman’s sample could reflect the fact that 
colleges, as well as universities, are included here. 


Studies of female academics in the United States (Astin, 1969; Patterson, 1971; 
Rossi, 1965) have shown that women faculty are less often married and have fewer 
children than their male counterparts. Additionally, women in academe have lower 
marriage rates and fewer children than women in the general population (Astin, 1969). 
This leads us to suspect that women graduate students should show a similar pattern. 
As Feldman (1974) points out, because marriage and children may be impediments to 
higher education for women, we expect to find significant differences in marital and 
parental status of women and men graduate students. In the Canadian sample, 
although there are no significant differences in the marital status of the sexes, the 
direction of the relationship indicates that more women (43%) than men (40%) are 
single. Fifty-three percent of the men are married as compared to 46% of the women. 
These data are similar to Acker’s (1974), where 55% of the men and 43% of the 
women in that sample are married. Feldman found somewhat higher marriage rates, 
with 69% of the men and 56% of the women being married. 


In the Canadian sample, more women than men have children, but the differences 
are not statistically significant. Seventy-four percent of men and 67% of women who 
are married, separated, divorced or widowed, are childless. Acker’s data show that 
women are 8% less likely to have children than their male counterparts. A decade or 
more ago, more women academics were single and childless than their male 
counterparts. Trends appear to be changing, and although a small difference is still 
found in terms of marital status, with more women being single, the married women in 
the Canadian sample have slightly more children than married men. These data 
suggest a modification in the role of women. In the past, women were forced to choose 
between the career role and the wife-mother role. This no longer appears to be the 
case, and young women are combining both roles to a greater extent than before. 


One last item of interest in the comparison of men and women graduate students 
in the United States and Canada relates to the question of grades. In both American 
and Canadian schools, women tend to receive better grades than men. Feldman (1974, 
p. 18) analyzed undergraduate grade-point averages, finding that GPA’s of B+ or 
better were achieved by 37% of the men, as compared to 52% of the women. In the 
Canadian sample, 85% of the women and 74% of the men had a B+ or better average 
in their first degree. At the master’s level, grades for men and women are similar, with 
75% of each group receiving an A- or better. At the Ph.D. level, women again do 
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better than men, although the difference is not significant at the .05 level. Eighty-two 
percent of women and 76% of men achieved an A- or better on completed course 
work. Women’s higher grades could perhaps be a function of the greater self-selection 


into graduate school among women, which would lead to more qualified women 
students (cf. Adler, 1976). 


Interaction with Faculty 


How do male and female graduate students in the United States and Canada fare 
in terms of interaction with faculty? Kjerulff and Blood (1973) report that in one 
American school, female graduate students have fewer discussions with their research 
advisors than do males. In terms of encounters in formal settings, the results from two 
Canadian schools show statistically significant, although not large, sex differences. In a 
given two-month period, men averaged 3.13 meetings with their advisors in the office 
setting, and women averaged 2.29 (p<.05). 


The Canadian findings support those of Holmstrom and Holmstrom (1974) and 
Kjerulff and Blood (1973), which report that women in American graduate schools 
interact less with professors on an informal basis than do men. Although there are no 
significant differences in the number of discussions men and women have with their 
advisor outside the office setting (X is 1.37 for men and 1.38 for women, in a given 
two-month period), women are less likely than men to meet informally (i.e., for coffee 
breaks, meals, drinks, parties, etc.) either on or off campus with professors in their 
department. Almost twice as many men (27%) as women (14%) meet with professors 
informally once a week or more, while 41% of men and 30% of women have such 
contact at least twice a month. Moreover, in the Canadian sample, 14% of women 
state that they have no informal contact with professors at all, whereas only 6% of the 
men do so (p<.05). 

Comparing men’s and women’s perceptions of their relationship with advisors, 
Kjerulff and Blood (1973) found that women view this relationship as less relaxed, 
jocular, egalitarian, casual or friendly than do men. Similar results are found in the 
Canadian sample. The question of egalitarianism, or collegiality, requires further 
exploration. The Canadian data indicate that for women, relationships with advisors 
are not necessarily egalitarian ones. In fact, men are more likely than women to 
describe the relationship as egalitarian. Fifty-seven percent of the men as compared to 
only 46% of the women state that “egalitarian” describes the relationship they have 
with their advisor well or very well (p<.05; Y=.11). Moreover, men (31%) are 
somewhat more likely than women (22%) to state that their advisor regards them as a 
colleague. These data are consistent with those of Sells (1975, p. 11) which show 
significant sex differences in terms of student perceptions of faculty’s view of them. 
More men than women in her sample report that their primary faculty contact regards 
them as a colleague. In our sample as well, more men (34%) than women (25%) state 
that the professor with whom they have the most academic contact regards them as a 
colleague. 

Furthermore, women are also significantly less likely to have discussions with their 
advisors concerning their advisors’ research. Thirty-four percent of the women and 
43% of the men in the Canadian sample had at least one such discussion during a 
given two-month period (p <.05). Discussions with one’s advisor concerning the 
latter’s research, as Kjerulff and Blood (1973, p. 630) suggest, can be considered a 
measure of the egalitarian nature of the relationship between student and advisor. 
These data suggest that women graduate students do not enjoy the same types of 
relationships with their advisors as do their male counterparts In either American or 


Canadian universities. 
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The importance of sponsorship in the professional socialization process in 
American schools is suggested by Acker (1974, p. 18). She found that for both men 
and women, the level of scholarly ambition (i.e., intention to publish in a professional 
journal) is positively correlated with enjoying sponsorship. Respondents in the 
Canadian study were asked to indicate the number of sponsors, both male and female, 
that they have. A sponsor was defined as “someone who has made efforts to give you 
career or research advice, direct your programme, employ you or secure jobs for you, 
secure financial aid, privileges, etc., for you.” The Canadian data do not support the 
contention based on American studies that females are excluded from this system. 
Both men and women average one sponsor each, although a surprisingly high 
proportion of; both sexes (438% of men; 42% of women) are without sponsorship of any 
kind. These proportions are slightly higher than those reported by Acker (1974, p. 17) 
who found that 32% of the men and 34% of the women were without sponsorship. 
Sells (1975, p. 12) notes that 64% of males and 50% of females in her study report 
having had a sponsor® (p<.05). 


For the Canadian sample, some sex-status homophily’ is evidenced in terms of 
sponsorship. Understandably enough (since most potential sponsors are men), there 
are no sex differences in terms of male sponsorship. Exactly one-half of the men and 
women have at least one male sponsor. However, for female sponsors, the situation is 
different. Of the total sample, 19% of women and 10% of men have female sponsors 
(x? = 5.01; p<.05; Y = 0.35), Looking at this question somewhat differently, 
significantly more women than men are among those graduate students who have 
female sponsors. Of this group, 64% are women and 36% are men (p <.05). 


An important variable was controlled in the Canadian study which appears 
relevant to sponsorhip, namely, programme of study. As expected, this variable 
influences the number of sponsors the respondents have. Graduate students in the 
humanities enjoy less sponsorship than do those in the natural sciences or the social 
sciences. Indeed, only 47% of humanities students state that they have at least one 
sponsor, whereas 66% of both natural science and social science students do so (X*= 
13.57; p<.001; Y = -.34). Sells (1975, p. 12) reports similar differences by field in 
American schools. Fifty-four percent of humanities students report sponsorship as 
compared to 65% of the physical sciences and 71% of the social sciences. These 
findings demonstrate the more pronounced sponsorship system in the natural science 
disciplines. The sponsorship issue is a very complex one, and much more work needs 
to be done before we can accurately depict its workings in the academic environment, 
particularly as it relates to women and men. 

Before leaving this section on interaction with professors, it should be noted that 
for both the American and Canadian samples, interaction with professors and 
participation in the sponsorship system are positively related to development of 
professional identification. 


Peer Group Interaction 


Both the American and Canadian data suggest that peer group interaction is more 
important for women than for men. For the Canadian sample, although men and 
women engage in a similar number of discussions with peers concerning their research 
interests (Xis 5.7 for men and 5.3 for women, in a given two-month period), women 
have a greater tendency than men to describe the discussions with peers concerning 
their research as helpful or extremely helpful. Seventy percent of the women as 
compared to only 52% of the men in the Canadian sample make this claim (X= 
10.55; p <.01; Y = .29). These results are similar to those of Kjerulff and Blood (19738, 
p. 629) who found that female graduate students in an American university tend to 
perceive their discussions with peers as more helpful than do men. 


104 


Equality of Opportunity 


Feldman (1973) found that in American schools, women tend to interact less on 
an informal basis with their peers than do men. Such informal contacts with peers 
should be important for professional socialization by creating and _ solidifying 
friendships which provide emotional support during graduate years. In the Canadian 
sample, these sex differences are not evident. In fact, graduate students in general do 
not socialize (i.e. spend leisure time) a great deal with their peers. Less than one-third 
of the respondents (27% men; 32% women) state that about one-half or more of the 
people they see socially are also fellow graduate students in their department. A male 
philosophy student sums up his relationship with peers by describing them as 
“nonexistent.” “Most of the people I simply choose not to associate with, because we 
share really little in common. ... There are other things I'd rather do with my free 
time than socialize with the people from around here.” 

Another interesting difference in American and Canadian graduate schools is the 
importance of peer group interaction for professional socialization. Feldman (1974) 
found that graduate students who meet informally with their peers are more likely to 
plan university or college teaching careers than those who do not. In the Canadian 
study, surprisingly few of the indicators of interaction with fellow graduate students 
are related to professional identification (cf. Hitchman, 1976). 


Discussion 


The results of this analysis suggest both similarities and differences in terms of 
the condition of women and men in American and Canadian universities. Our data on 
equality of access show similar attrition rates of women in both countries. However, 
American women appear to enjoy greater equality of access in terms of both degrees 
granted, and faculty positions held. Considering these national statistics as a whole, we 
are led to question the existence of equality of access for women in both American 
and Canadian higher education. 

In terms of interaction patterns, again, both similarities and differences are found. 
Women in American and Canadian graduate schools interact less with professors, 
perceive their relationships as less relaxed and friendly, and enjoy less collegiality with 
their professors than do men. However, the lack of sponsorship found in American 
schools does not appear in the Canadian sample. 

What explanations can be offered for the sex differences found here in terms of 
faculty-student interaction? Are women the objects of discrimination (i.e., do faculty 
prefer to interact with male rather than female students?), or do female students act 
in such a way as to discourage faculty-student contact? If one subscribes to the 
discrimination hypothesis then the gender differences may be attributed to the ways 
in which professors perceive women students, i.e., as less committed, less serious, 
and/or less suited to academic pursuits. Hence, they are not perceived as future 
colleagues, and interaction with them on a formal or informal basis is less necessary 
than it is for male students. If one looks to the women students for an explanation, 
one could point to a model of female socialization, which tends to produce more 
passive, person-oriented individuals, more sensitive to criticism, and in greater need of 
support than their male counterparts. Hence, women students may not seek out 
faculty contact as aggressively as men, and being more sensitive to criticism, will tend 
to evaluate relationships with professors in a less positive fashion. One would not be 
overly cautious in suggesting that a combination of factors are at work here, in a 
“catch-22” situation. Lack of interaction with faculty mitigates against the formation 
of a professional self-concept, while lack of a professional image and the concomitant 
behaviour is unlikely to evoke greater faculty-student contact. 

The American literature (Caplow & McGee, 1958; Epstein, 1971, 1973; Hughes, 
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1973) suggests that the female graduate student may be excluded from the 
sponsorship system. Given the sex-status distribution of graduate departments, where 
most potential sponsors tend to be male, women graduate students must seek out 
sponsors of the opposite sex. Because of cultural prescriptions regarding cross-sex 
behaviour, male professors may be hesitant to engage in a close working relationship 
of the sponsor-protégé variety with a woman. Additionally, so the argument goes, a 
male professor may have difficulty perceiving a female graduate student as his 
academic successor, fearing that she may one day desert the profession and retreat 
into the home (Hughes, 1973). The lack of sex differences found in terms of 
sponsorship in Canadian schools could reflect the very broad definition we have given 
it. As Acker (1974, p. 18) notes, we do not know from this question whether the 
sponsor takes the student seriously or will give him or her a reference for a job. 
Furthermore, given the sex differences in sponsorship noted by Sells (1975), and the 
differences found in the egalitarian nature of faculty-student relationships enjoyed by 
women and men, caution must be used in drawing definitive conclusions from the 
Canadian data. It does appear, however, that women are not excluded from this 
system, or at least do not perceive themselves to be excluded. The difference between 
Sells’ data and the findings reported in the Canadian study could reflect a change in 
attitudes towards women over time. Perhaps when Sells’ respondents were graduate 
students, women were considered less serious students than they are at the present 
time, and hence had difficulty finding sponsorship. Moreover, the differences found 
between the American and Canadian schools could reflect the nature of the samples. 
The Canadian data are limited to doctoral candidates, whereas the American data 
include master’s candidates as well. Perhaps it is easier for women to obtain 
sponsorship at the doctoral level, after they have “proven” themselves as successful 
master’s students. 


Sex-status homophily suggests another possible reason for the greater sponsorship 
of women students than that found by Sells. Since the numbers of Canadian female 
faculty are growing, albeit slowly, and since some sex-status homophily is found in 
terms of sponsorship, it could be that these women faculty members are acting as 
sponsors for women students, hence augmenting the proportion of women with 
sponsorship. The implications of the sex-status homophilic sponsorship relation in 
graduate education have been elaborated elsewhere (cf. Symons, 1978b). 


In terms of interaction with peers, both American and Canadian women view © 
these relationships as more important than do men. However, differences in informal 
interaction patterns between the sexes found in American studies are not replicated in 
the Canadian one. These latter differences could perhaps be accounted for by the 
nature of the universities sampled. The Canadian schools under study are situated in a 
large urban setting, and the students live in various parts of the city. In such 
commuter schools, students may have more ties with the community, and hence less 
time to spend on an informal basis with their peers. 


Weak relationships were found between peer group interaction and professional 
socialization in Canadian graduate schools. Perhaps the more elitist nature of graduate 
education in Canada can help explain these differences. Professional socialization takes 
place through interaction with significant others. Since these individuals are 
characterized as powerful, they are accorded the requisite attention and deference. 
Given a greater degree of elitism in Canadian schools, perhaps Canadian students view 
their peers as less powerful than do their American counterparts, in terms of assisting 
them in managing a doctoral degree. Hence, because of less recognition of the 
potential power of peers in Canadian schools, they are not taken into account to an 
extent that would affect professional identification, as is the case with American 
students. 
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The similarities in the professional socialization process found in American and 
Canadian graduate schools in terms of sex differences are not surprising, since 
Canadian graduate schools in the 1970’s are modeled in large part on the American 
system, and the large percentage of American professors, particularly in some 
disciplines, provides continuity in style as well as ties with American schools that 
could well account for the similar socialization experiences. Furthermore, the condition 
of women in Canadian graduate schools is similar to that of their American 
counterparts. This situation is attributable to the same cause in both countries, 
namely the masculine nature of the university environment. 


This analysis of professional socialization in American and Canadian graduate 
schools points to significant differences in experiences between women and men. If, as 
other studies and symbolic interaction theory suggest, interaction with significant 
others is important for socialization, women in Canadian as well as American graduate 
schools have less access to this avenue for professional development than do their 
male peers. All of the studies reported in this paper point to differential experiences 
of graduate school by women and men. One should not assume that because both 
males and females ostensibly pass through the same “shared ordeal” (Lortie, 1959) in 
the same institutional environment, they necessarily experience the professional 
socialization process in the same way. It would appear, then, that not only do women 
not enjoy equality of access to higher education in both the United States and 
Canada, but neither do they enjoy equality of treatment within these institutions. 


In order to ensure equality of treatment for women and men in graduate school, 
both students and faculty must take the initiative. Faculty should examine their 
attitudes and behaviour with respect to women students, and if they are according 
differential treatment, they should assess if there are legitimate reasons for such 
discrimination. If not, attempts must be made to ensure that women students have the 
same access to professors, the same quality of contact, and the same serious 
consideration as men. 

Women students, for their part, also have a role to play by initiating contact with 
faculty, and attempting to present a professional self-image. Support systems can be 
set up with other women students, where problems are discussed, and strategies 
planned. Women students would be wise to avoid too much reliance on faculty in 
times of stress, rather seeking support from other women during these low periods, for 
as Baird (1969) points out, when students experience stress, the faculty is often of 
little help. Over-reliance on professors at such times only reinforces stereotyped 
notions of women. 


Conclusion 


This research is both relevant and timely, addressing as it does an important 
social issue, not only for the university itself, but also for the larger societies in 
question. The need for more women academics has been apparent for a number of 
years in the United States and Canada. These highly trained women not only develop 
their own skills and potential, as well as add to the productivity of a country, but they 
also serve an important function as role models’® (Adler, 1976; Ambert, 1976; Bernard, 
1964; Rossi, 1970) for increasing numbers of female students who pass through 
universities. Hence, studying the condition of women graduate students has relevance 
not only for the academic profession, but also more far-reaching implications for the 
nature of the society of the future, and the sexual division of labour in North America. 


Research of this nature can help us gain insights into opportunities for women in 
the academic profession, and by extension, for young women who enter the university 
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to prepare for a variety of occupations. If women in Canada and the United States are 
to develop their skills and realize their potential, equality of opportunity, both in 
terms of access and treatment, must be accorded them in the institutions of higher 
learning. 


Notes 
1: For a comparison of American and Canadian universities see Goldenberg (1974). For a history of 
women in higher education in the United States, see Graham, (1978) and in Canada (Hitchman, 1976, 
pp. 36-41). 
2 Assessing the degree of equality of treatment may be done in a number of ways. For example, 


university structures, which incorporate such things as degree requirements, may differentially affect 
men and women students. A period of residency, which is usually mandatory for most master’s and 
doctoral programmes, may deter women students who can only study part-time due to financial 
and/or family constraints. Lack of adequate support services such as day care may also affect men 
and women differently, since, in spite of alleged changes in the sexual division of labour in Canadian 
and American societies, women still assume the major responsibility for child care. Another way of 
assessing equality of treatment in higher education is to focus on the interactional setting of the 
graduate school, and examine the processes at work. It is the latter approach that is taken in this 


paper. 


Sy Occupations are sex-typed as masculine or feminine when “a very large majority of those in them are 
of one sex and when there is an associated normative expectation that this is as it should be” 
(Merton, quoted in Epstein, 1971, p. 68). 


4, One of the few exceptions is a study done by J. J. Williams in 1950 involving women in medicine. 


Ds A more detailed analysis of other aspects of particular references to Canadian schools is given in 
Hitchman (1976). 


6. For a discussion of this change in the American context see Graham (1978). 


Te The questions read as follows: 
Please state your religious preference: Protestant; Roman Catholic; Jewish; None; Other. Specify. 
(Hitchman, 1976, p. 338). 
What is your present religion? Protestant; Roman Catholic; Jewish; Other; None; No answer. 
(Feldman, 1974, p. 204) 
In Acker’s (1974) study, the exact question is not reported, but respondents were asked to state their 
own “current religion.” 


8. It should be noted here that Sell’s definition of sponsorship differs somewhat from ours. Her ~ 
question reads “As a graduate student, was there a faculty member who acted as your ‘sponsor’ when 
you were looking for your first job?” It was addressed to faculty respondents who participated in the 
Carnegie study on higher education. 


Sh. The term status homophily was first used by Lazarsfeld and Merton (1954, p. 24) to describe the 
friendship process. This concept was extended by Kandel (1966) and Marx and Spray (1971) to 
include the therapeutic process. The notion of status homophily may be extended to the case of the 
faculty-student relationship in graduate school. Homophily is used here to refer to similarities 
between certain characteristics of faculty and graduate students that may affect the socialization 
process. It is the influence of sex-status homophilic faculty-student relationships that is considered. 
For an analysis of the significance of sex-status homophily for professional socialization see Symons 
(1978b). 


10. The relationship between faculty and students is not a simple one. For a discussion of this issue with 
respect to graduate students, which considers both the sex of the faculty as well as that of the 
student, see Symons (1978b). 


A revised and expanded version of a paper presented at the Canadian Sociology and Anthropology 
Association Annual Meetings, London, Ontario, May, 1978. 
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Teachers and Decisions 


In this study, an attempt is made to examine the types of decisions with which 
teachers wish to be involved. Questionnaires asking respondents to select a type of 
decisional approach were sent to 500 elementary and secondary teachers in six 
school districts within Ontario, Canada. Responses were received from 49.4% of 
those surveyed. Percentages and chi square tests were employed as analytical 
instruments. The findings indicated that: (a) teachers do not wish to make the 
final decision in matters dealing with discipline, economics, or home-school 
problems; (b) in the curriculum area, they wish only an equal voice with the 
principal; (c) in federation matters, they desire sole authority on decisional 
questions; and (d) they want greater consultative involvement in the 
decision-making process. 


Ideally, Louis J. Rubin (1970) argues, “a school should be a synergy in which the 
forces of its several components unite in the accomplishment of the organizational 
goals” (p. 52). Two of the components to which he refers are obviously principals and 
teachers, and one of the ways in which these two factors unite to accomplish the 
organizational goals is through the decision-making function. That process will be 
examined in this paper. 

There is no dearth of literature dealing with studies of educational 
decision-making, and most such studies seem to agree that “there is some kind of 
sequential order...in the process, but the agreement as to what elements are to be 
included is much less pronounced” (Jenson & Clark, 1964, p. 52). This tends to suggest 
that there is still some controversy surrounding the relative involvement of the parties 
involved with the decision-making process of the school environment. 


The principal is presumed to be the designated leader of the contemporary school. 
One need only look at the job description of such a role to realize that the principles 
of responsibility and accountability have indeed become more augmented in recent 
times. If, in fact, principals are under pressure to provide “for the total education of 
students, supervision of all personnel and for all phases of pupil personnel 
administration within the building” (Cullers, 1976, p. 179), it should not be overly 
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surprising that they would demand the right to be an integral part of the 
decision-making process. Here, they might argue, the old administrative truism seems 
very applicable: “responsibility without authority leads to ineffectiveness” (Nolte, 1974, 
p. 29), and if there is one thing that the political and social realities of the day will 
not tolerate, it is an ineffective and inefficient principal. 


The desire of teachers to be included in the decision-making process is enunciated 
quite well by Rubin (1970) who states that 


whether they have the right to or not, and whether they act from necessity or desire, 
teachers are asking for the power to affect the educational policies of the schools in 
which they work. Teachers are seeking, in short, to make the privileges inherent in 
the traditional idea of democratic leadership more of a reality. (p.54) 


While one might argue that there are a multitude of factors responsible for this 
growing desire on the part of the instructional staff to become more involved in the 
decisional process, two predominant arguments stand out. One is that as the teacher 
unions and federations have increased their power bases, the strength has siphoned 
down into the ranks of the individual schools, and hence has affected the process of 
decision-making at that level. A second argument often advocated for increased 
participation in this area has been noted repeatedly in the literature. It deals with the 
basic assumption that the morale of workers increases when they feel that they are a 
significant part of the decisional process, and only when such authentic involvement 
occurs will the educational environment operate at its optimum level. 


Hence, the traditional role of the principal, however benevolent the latter may be, 
as the sole decision-maker within the school setting is undergoing a re-examination. 
This study is an attempt to contribute to such a movement. It will try to ascertain if, 
in fact, teachers do wish to be involved in all decisions made at the school level, or if 
such involvement is really only sought in certain specific areas. Prior to outlining the 
findings, however, a description of the methodology employed to collect and analyze 
the data will be offered. 


Methodology 


Five hundred questionnaires were randomly distributed to teachers employed by. 
six county school boards in Ontario, Canada. Each questionnaire listed twenty 
problem statements which could be categorized into five specific areas: (1) curriculum, 
(2) economic matters, (3) federation, (4) discipline, and (5) parental problems. There 
were four problem statements for each of the areas. These decisional situations had 
been identified through the following process. 


A group of twenty-five teachers were each asked to list randomly, decisional 
matters that could arise in an educational setting. The researcher, along with a 
colleague, collated the responses and categorized each one of them under one of the 
five specific areas mentioned previously. The teachers were then asked individually to 
rank order, on the basis of frequency of occurrence, the decisional situations placed 
under each major heading. The results were tabulated and the four decisional 
situations arising the most frequently in each category were determined. The twenty 
situations were then listed in random order on the questionnaire. Finally, a pretest of 
the instrument was conducted on a group of thirty-six teachers from the local area to 
verify the clarity and validity of the decisional situations identified. 


The recipients of the questionnaire were requested to indicate their feelings about 
involvement in the decision-making process by selecting one of the following responses 
for each problem presented: 
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A decision by the principal alone, 


A decision by the principal after a consultation with each member of the 
staff involved in implementing the decision, 


c. A combined decision by the principals and teachers together, or 


d. A decision by the teachers alone. 


Percentages were then calculated on the basis of the number of responses selected for 
each problem area. 


In addition, the questionnaire also requested demographic information concerning 
the sex and age of the respondents, their years of teaching experience, and the grade 
level in which they carry out the majority of their teaching duties. Chi square tests 
were conducted on each of these variables to ascertain their relationship to the 
decisional selection of the respondents for each problem area. 


The 247 teachers (49.4% of those surveyed) who returned the questionnaires 
closely resembled the population of the survey on the following variables: type and 
size of school, sex, age, grade level taught, teaching experience and _ professional 
preparation. 


Presentation of Data 


There seems to be little doubt that teachers view themselves as an integral part of 
the educational decision-making process. However, the data of the study suggest that 
they view themselves more as consultants to the principal than as sole decision-makers 
in most educational matters. In fact, as Table 1 indicates, of the five categories 
examined within this study, only one area, that dealing with federation matters, 
emerges as an area in which teachers feel they alone should make any necessary 
decisions. In each of the other four categories, the largest single percentage of 
respondents opted for a combined principal-teacher decision-making process. However, 
it is important to note that in the areas of economics, discipline and parent-school 
matters, the combined percentages of categories A and B, both essentially giving the 
principal the primary decision-making function, are greater than the percentage of 
respondees in category C. The only exception to this concerns decisions dealing with 
curriculum matters. In that area, the shared principal-teacher decision-making concept 
garnered greater support than the other three areas combined. It is also essential to 
note that although a large number of individuals elected to give the principals the 
ultimate decision-making authority in three of the five categories under consideration, 
a good portion of them felt that they should be consulted for an opinion prior to that 
decision. Hence, once might conclude that teachers feel that: 


they should be the sole decision makers in federation matters; 
b. in curriculum matters, decisions should be made by teachers and principals 
together; 


c. the ultimate decision for economic, discipline and home-school matters 
should be left in the hands of the principal. However, a large number of 
teachers feel that such decisions should only be made after prior 


consultation with the teaching staff involved. 
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DECISIONAL APPROACHES TO VARIOUS PROBLEM AREAS 


Type of Decision-Making Process By Percentage 


Problem Area A B C D 
Curriculum 8.9 ver 61.7 ey 
Economic Matters 29.3 21.0 43.9 5.8 
Federation Wasa low oiled 35.2 
Discipline 23.6 26.1 42.2 8.1 
Parental Matters 28.7 Dee 39.5 4.6 
A = Decision By Principal Alone 

B = Decision By Principal After Consultation With Teachers 

C = Decision By Principal And Teachers Combined 

D = Decision By Teachers Alone 


Several demographic variables were also isolated within the study. These were sex, 
age, experience, and grade. Each of the problem areas were examined in light of these 
factors. Looking first at the differences created by the male-female variable, one finds 
that women expressed a more democratic attitude toward decision-making than the 
men. This appears to hold true for each of the problem areas identified here, as a 
higher percentage of females selected the combined decisional approach than did the 
males. Only in the areas of economics, federation and discipline do the age and 
experience variables come into play. In both of these categories, it would appear that 
the older the teacher, the less willing he/she is to leave the decision-making activity 
solely in the principal’s hands. Rather, a desire was expressed to place decisions of 
these types under combined principal-teacher auspices. The grade level variable 
appears to affect the responses offered by teachers in the curriculum, discipline and 
federation areas. In the first two categories, the data tend to indicate that secondary 
school teachers are more apt to allow the principal to be the sole decision-maker than 
are their elementary counterparts. On the other hand, when one examines decisions 
concerning federation matters the elementary teachers are the ones more willing to 
share decision-making rights. The secondary teachers responded that decisions about: 
federation matters should fall primarily into the teachers’ domain. 


Analysis of Data 


Contrary to suggestions by Chase (1953) and Cullers (1976) that teachers expect 
democratic administration from their principals, the results of this study indicate 
rather that principals are expected to make the ultimate decisions about school 
economic matters, discipline, and home-school problems on their own. One possible 
reason for this position in the latter two areas might be a realization and acquiescence 
on the part of teachers to the existent community role expectations for the principal. 
As Woodring (1978) and Thompson (1977) point out, very often the parents may 
expect the principal to employ a more authoritarian style of leadership with both staff 
and students. Therationale for the data dealing with decisions about school economic 
matters might well be that provided by Hall (1972). He argues that, “where decisions 
are routine and information standardized, or if the tasks are well structured or 
familiar, a more authoritarian style is appropriate” (p. 250). 
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Hence, one might argue that school economic matters are structured, to a great 
extent, by local budget constraints and guidelines, and in most cases, are made in a 
school situation on the basis of fairly standardized information. Moreover, the 
principal probably could be said to have the best overall perspective on the situation, 
and hence be the only one with the information base necessary to make the most 
appropriate decision. Thus, the teachers acquiesce to the principal in these matters. 


Teacher demand for equal involvement in curriculum decisions is not overly 
surprising. A study by Inkpen, Ponder, and Crocker (1974) indicated that teachers do 
indeed desire increased opportunity for participation in these matters. The data of 
this study substantiate their conclusions. Since teachers are both the individual in 
constant contact with the needs and abilities existent within the classroom, and the 
actual “delivery system” of the curriculum process, it is quite apparent that they do 
have a sizeable investment in ascertaining the appropriateness of the curriculum for 
their classroom. On the other hand, a realization that someone must coordinate the 
overall program within the school might account for teachers’ feelings that principals 
should also have a voice in decisions of this nature. 


The final problem area examined within this paper, that dealing with federation 
matters, was one in which teachers felt they alone should be the decision-makers. This 
is not too surprising when one considers the “two hat syndrome” forced upon the 
principal (Awender, 1978, p. 563). The latter is expected to be part teacher and part 
administrator, and it is this second role that casts him in a management perspective to 
many teachers. Hence, there may be a natural reluctance on the part of the 
instructional staff to share decisions about matters, generally regarded as the domain 
of the rank-and-file, with quasi-management personnel. 


Turning to the demographic variables examined in the study, it was discovered 
that females have a more democratic orientation toward educational decision-making 
than do males. These data coincide with an earlier study by Goldberg in 1967 which, 
in turn, is cited in a subsequent study by Robinson (1968). A closer examination of 
this variable does present another interesting trend. In all categories except the 
home-school area, a greater prcentage of males than females expressed a desire to 
relinquish the decisional process to the principals. One might speculate that this 
results from the possibility that generally, a higher percentage of men than women 
within the system traditionally have been “one income and upward bound people”, and 
hence have found involvement in the decision-making process more of a risk-taking 
venture than females, and thus have been more willing to leave decisions in the hands 
of the principal. Both the age and experience variables demonstrated a positive 
relationship with involvement in the decision-making process. As might be expected, 
the older, more experienced teachers, perhaps feeling that they had accumulated 
sufficient background knowledge and experience to provide meaningful input into 
decisions concerning the problem areas examined in the study, laid claim to greater 
involvement in the decisional process. One might also speculate that additional 
security emanated from age and experience, and hence they perceived additional 
involvement in ultimate decisions as less of a risk-taking procedure than those 
teachers at the other end of the continuum. The study also points out that secondary 
school teachers are more willing than those individuals in the elementary sector to 
permit principals the right to make decisions in the curriculum and discipline area. 
Such a willingness might result from the fact that these two areas have traditionally 
been out of the average secondary school teacher’s domain anyway. In the past, 
discipline has very often been assigned to a vice-principal, and curriculum matters to 
department heads. Thus, these individuals have not, as a general rule, made final 
decisions of this nature. Conversely, the secondary teachers are much more reluctant 
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than their elementary counterparts to give their principals a final or even equal voice 
in decisions about federation matters. Such a development is not all that surprising if 
one considers the relative historical actions of the two organizations within Ontario. 


Conclusions 


This study was undertaken to determine how teachers viewed their role in the 
decision-making process. Moreover, an attempt was made to discover how the 
variables of sex, age, grade level, or experience would affect their particular viewpoint. 


It would appear from the findings of the study that teachers are not nearly as 
willing to make the final decision on matters dealing with the areas examined here as 
a great deal of the literature on the subject might suggest. In fact, the only area where 
they appear to desire the ultimate authority for decisions is that dealing with 
federation matters. This does not mean, however, that the teachers wish to relinquish 
involvement in the decision- making process. Rather, the results tend to indicate that 
involvement is sought. However, the form of that involvement is not decision-making 
authority, or even in most cases a combined decision by teachers and principals, but 
rather simply consultation with the principal prior to the latter making the final 
decision. 


The study obviously has its limitations. For one thing, it has not ascertained the 
relative importance of each of the problem areas to the teacher. Hence, one cannot 
empirically co-relate the variables of importance and involvement. Nor was any 
attempt made to determine what the actual state of involvement by teachers in 
decision-making actually is. Rather, the study simply approached the question of what 
should be. Thus, one cannot draw conclusions about discontent between the “is” and 
“ought” factors, as did an earlier study by Knoop and O’Reilly (1977). Additional data 


on these areas may be developed at some future date. 


It should also be noted here that the study did not focus upon the situational 
nature of leadership and decision-making outlined in the Vroom-Yetton contingency 
model (1973), and later validated by Vroom and Jago (1978). The model suggests that 
the type of decisional process that a leader might employ in a particular situation 
should be based upon the following set of considerations as they relate to that specific 
situation: the leader’s information base, the congruence of the subordinate’s personal 
goals and those of the organization, the degree of structure inherent in the problem, 
the necessity of obtaining subordinate acceptance of the decision, the desire to 
eliminate conflict and differences among subordinates, their perceptions of fairness, 
and finally, the degree of acceptance that is sought about the decision. For the 
purposes of this study, however, it was decided to pursue the question of involvement 
in the decision-making process from the perspective of subordinates’ desires rather 
than from a subordinate’s feasibility viewpoint established in accordance with the 
aforementioned model. 


Hence, the results of this study suggest that teachers do not really appear to see 
themselves from a decisional deprivation (Belasco & Alutto, 1972) viewpoint, for they 
do not appear to want ultimate authority to make decisions in a large number of 
areas. As Ponder (1974) points out, “since involvement in decision making is a time 
consuming process, teachers are likely to have established a hierarchy of decision 
priorities” (p. 11). Rather, from the data examined here, one might conclude that a 
consultative deprivation exists, and it is to this that administrators should address 
themselves. If they do, it would appear that many of the negative feelings generated 
by unilateral decisions might well be eliminated, and hence an atmosphere more 
conducive to a sound and successful educational organization would be created. 
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In this study, a behavioural therapy, systematic desensitization using deep muscle 
relaxation, was administered to a group of preservice elementary teachers to 
determine its effectiveness in treating mathematics anxiety. Statistically significant 
reductions in mathematics anxiety were obtained as predicted. The effects of the 
therapy on subjects’ enjoyment of mathematics and on its perceived value were 
also investigated. No statistically significant changes in these attitude measures 
were found. Finally, the relationship between mathematics anxiety and other global 
measures of anxiety was analyzed. Generally, the results indicated that 
mathematics anxiety is not an isolable phenomenon, but is related to a more 
globally defined notion of anxiety. 


Since its introduction in 1958, systematic desensitization has been demonstrated to 
be an effective therapeutic technique for the reduction of anxiety with a variety of 
phobias: kleptomania (Mazagas, 1972), oral exams (Bruno & McCullough, 1973), 
mathematics anxiety with university students (Richardson & Suinn, 1973), and 
test-taking (Deffenbacker & Kemper, 1974), to name a few applications of the 
procedure. 


In desensitization therapy, a physiological state inhibiting to anxiety is induced in 
the subject, usually deep muscle relaxation using a modification of Jacobson’s 
progressive relaxation techniques developed in 1938. In desensitization therapy, a 
subject, after becoming very relaxed, is presented with a weakly anxiety-arousing 
stimulus. In such instances the relaxation response is stronger than the anxiety 
response. Over repeated exposures for brief periods of time, the association of the 
stimulus event with the anxiety response is gradually weakened and broken, while 
being built up in its stead is the new association of the same stimulus event with the 
relaxation response. Successively stronger stimuli are similarly introduced and treated. 
In a step-by-step fashion, previous associations between stimuli cues and anxiety 
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responses are broken down and replaced by the new associations. This process has 
been called “reciprocal inhibition” by Wolpe (1958). Theoretically, the relaxation 
reciprocally inhibits the concurrent anxiety, thus weakening the anxiety response 
habit. In passing, while research studies have established the effectiveness of 
desensitization as a treatment procedure, they have not supported Wolpe’s original 
account of the mechanisms of fear reduction (Sieber, O’Neil & Tobias, 1977). Other 
therapists have acknowledged the contributions of behavioural therapy in general and 
desensitization in particular, but have argued that behavioural therapy is simply not 
capable of accounting for all manner of behavioural change. In their view, therapists 
must necessarily adopt an eclectic point of view if they wish to understand 
behavioural changes (Lazarus, 1976, 1977). 


Problem 
This study was designed to investigate the viability of applying systematic 
desensitization therapy to alleviate mathematics anxiety (anxiety in the presence of 
mathematics) as exhibited by elementary teachers-in-training. Also, the relationship 
between attitudes and anxiety toward mathematics and the effects of systematic 
desensitization on this relationship were assessed. 


From the literature, it was anticipated that those high anxious students receiving 
treatment would report a reduction in the level of mathematics anxiety to a degree 
beyond that of those who did not receive treatment. To provide support for this 
anticipation, the earlier noted study by Richardson and Suinn could be used, and it 
was also noted that Hyman (1974) had used desensitization to reduce mathematics 
anxiety among a group of high school students. 

The relationship between introversion-extraversion and level of anxiety is unclear. 
There does seem to be some interaction between these variables on certain 
performance measures. Trown (1970) found that extraverts were superior on retention 
tests when their level of anxiety was high but were superior on transfer tests when 
their anxiety was low. It has been found that scores on the Taylor Manifest Anxiety 
Scale (MAS) are frequently a mixture of the introversion and neuroticism factors of 
the Eysenck Personality Inventory (EPI) and do not represent a unifactorial concept 
(Fenz & Epstein, 1965). The seemingly complex relationship between 
introversion-extraversion and anxiety was investigated further in this study. More 
specifically, the notion of anxiety used was that of anxiety towards mathematics. 

Spielberger (1966) has emphasized the necessity of distinguishing between anxiety 
as a stable trait and anxiety as a more transitory state variable. The relevant question 
for this study concerned the status of mathematics anxiety either as a transitory state 
phenomenon or as a manifestation of a more global anxiety trait. The status of 
mathematics anxiety as a state or a trait was investigated in this study. 

The effects of systematic desensitization therapy on student attitudes toward 
mathematics is unclear. Addleman (1972) found that systematic desensitization had no 
significant effect on measured attitudes. Nonetheless, conceptually it would seem that 
anxiety and attitudes toward mathematics ought to be dynamically interrelated. It 
seems likely that attitude toward mathematics measurement has suffered in the past 
from a lack of conceptual clarity. The use of Aiken’s (1974) two scales of attitude 
toward mathematics in this study was an attempt to shed some light on these complex 


phenomena. 
Sample | 
The subjects in the treatment group were drawn from an elementary education 


mathematics teaching methods class. They were drawn on the basis of scores on the 
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Mathematics Anxiety Rating Scale (MARS) (Suinn, 1972), given to the class at the 
beginning of the school term. It was to those students scoring highest on this scale 
that the program was offered. They were defined as the “high anxious group” within 
the class population. The remainder of the class was termed “mid/low anxious group.” 


Because of attrition the total population of the class changed over time. This 
meant that the treatment groups changed as well. These changes are defined in Table 
His 


Participation was on a volunteer basis. Those who did volunteer were randomly 
assigned to one of two treatment groups (A or B). 


TABLE 1 


SAMPLE SIZES ON TEST OCCASIONS 


Test Occasions 


I II Ill IV 
Four Weeks Four Weeks One Week 
Group A (High Anxious, First Treatment Group) ila 11 ill S 
Group B (High Anxious, Second Treatment Group) 13 7 7 d 
Group C (High Anxious, No Treatment) 27 26 
Group D (Mid/Low Anxious, No Treatment) 90 60 
Instrumentation 


Four test instruments were used in this study: 

Mathematics Anxiety Rating Scale (MARS) (Suinn, Edie, Nicolletti, & Spinelli, 
1972). This is a 98-item inventory of items that would presumably arouse mathematics 
anxiety. Its test-retest reliability on 119 college students was .78 according to Suinn. 
Suinn reports that its construct validity in research settings has been promising. 

Aiken Attitude Scales (Aiken, 1974). This instrument is scored on two scales. 
Aiken reported that the eleven-item “Enjoyment of Mathematics” subscale had a 
coefficient Alpha of .95 based on an administration to 190 college freshmen. The 
ten-item “Value of Mathematics” subscale had a coefficient Alpha of .85. 

State-Trait Anxiety Inventory (STAI) (Speilberger, Gorsuch & Lushene, 1970). 
This instrument is scored on two subscales. One of the scales measures the subject’s 
present state of anxiety (A-State). The other scale measures anxiety as a general trait 
(A-Trait). Speilberger reports that the test-retest reliability of the STAI (A Trait) is 
relatively high but stability coefficients for STAI (A-State) tend to be relatively low, 
as would be expected for a measure that is designed to be influenced by situational 
factors. Both scales have high internal-consistency coefficients. 

Eysenck Personality Inventory (EPI) (Eysenck & Eysenck, 1968). This 
instrument is scored on two subscales — Neuroticism and Extraversion. Test-retest 
reliabilities over a long period of time ranged between .80 and .94 for the separate 
scales as reported by Eysenck. 


Procedure 


Both treatment groups (A and B) received the systematic desensitization therapy 
based on an anxiety hierarchy constructed from their responses on the MARS. The 
anxiety hierarchy consisted of fifteen graded items from the MARS and three graded 
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items relating to the teaching of elementary school mathematics. Groups C and D 
received no treatment but were included for comparison purposes. 

The entire experimental procedure was completed within a period of nine weeks. 
These nine weeks were punctuated by test administrations on four different occasions. 
On test occasion I the four groups were given the four tests defined above. Between 
test occasions I and II, Group A received the three-week therapy treatment. On test 
occasion II, Groups A and B were given the MARS and the Aiken Attitude Scales. 
Between test occasions II and III, Group B received the therapy treatment. On test 
occasion III, Groups A and B again were given the MARS and the Aiken Attitude 
Scales. Finally, these two tests were administered to all groups on test occasion IV. 
The procedure is summarized in Table 2. 


TABLE 2 
PROCEDURE SCHEDULE 


Test Occasion 


Four Weeks 1 Four Weeks "“! One Week IV 
Group A Pre-Treatment Post-Treatment Follow-up Follow-up 
--------- Treatment--------- 
Group B Initial Pre-Treatment Post-Treatment Follow-up 
o2------- Treatment-------- 
Group C No bo ~------ No Treatment------- Follow-up 
Group D initial ao <2 eee No Treatment------- Follow-up 
No Treatment 
Results 


One of the purposes of the study was to investigate the relationship between 
anxiety toward mathematics and anxiety as defined more generally. Another purpose 
was to explore the relationship between mathematics anxiety and anxiety measured as 
a state variable and as a trait variable. The results of the test occasion I 
administration of the MARS, EPI and the STAI are the relevant sources of 
information for these questions. The means and standard deviations for the MARS, 
EPI and STAI are given in Table 3. The matrix of correlations for these variables is 
given in Table 4. 


TABLE 3 
MARS, EPI, AND STAI MEANS AND 
STANDARD DEVIATIONS (N=125) 


Instrument Mean Standard Deviation 
MARS 197.576 45.433 
SAL 
State Scale 46.368 9.873 
Trait Scale 46.896 10.404 
EPI 
Neuroticism Scale 34.872 5.002 
Extraversion Scale 36.088 4.056 
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TABLE 4 
CORRELATIONS AMONG THE MARS, EPI AND STAI 


MARS EPI EPI SAT SAT 
Neuroticism Extraversion State Trait 
MARS 1.000 0.425 -0.087 0.412 0.540 
EPI 1.000 -0.335 0.445 0.772 
Neuroticism 
EPI 1.000 -0.166 -0.279 
Extraversion 
STAI 1.000 0.635 
State 
STAI 1.000 
Trait 


Under the null hypothesis that the population correlation was zero, each 
correlation coefficient was tested for statistical significance. The results indicated that 
all correlations were significantly different from zero (p<.01, df = 123) except the 
correlation of MARS and EPI-Extraversion (p=.333, df = 123) and EPI-Extraversion 
with STAI-State (p=.065, df = 123). 


Using two-way analysis of variance with repeated measures on the MARS (test 
occasions I and II) as one of the factors and the groups (Group A and Group B) as the 
other factor, a statistical significance (@ = .01) was found in the differences of the 
repeated measures and in the interaction of the repeated measures and the group 
factor. For Group A the test occasions I and II mean scores were 253.7 and 170.7 
respectively. The standard deviations were 40.12 and 26.41 respectively. Corresponding 
mean scores for Group B were 242.3 and 223.3 with standard deviations of 22.6 and 
57.6 respectively. Some of this information is included in Table 5 along with. 
information obtained from the other test occasions and from Groups C and D. 


TABLE 5 
MEAN SCORES ON THE MARS FOR ALL 
GROUPS ON ALL THE TEST OCCASIONS 


Test Occasion 


I II III IV 

Group A 253.7 170.7 163.0 158.1 

N=11 =8 

Group B 242.3 Dada US) hell 188.0 
N=7 

Group C PB wiall 191.0 
N=26 

Group D nee 162.0 
N=60 


Note: Number of subjects in Group A changes from 11 to 8 on test occasion IV as 3 subjects were 
unavailable at that time for assessment. 
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In analyzing the results for the Aiken Attitude Scales in the same way, no 
statistically significant differences were found. 


It is apparent in Table 5 that all four groups had reduced mean scores when the 
data from test occasions I and IV were compared. A two-way analysis of variance with 
the repeated measures (test occasions I and IV) as one of the factors and the groups 
(Groups A, B, C, and D) as the other factor revealed a statistically significant 
reduction in mean scores across all of the groups (a = .01), a statistically significant 
group effect (a = .01), and a statistically significant interaction between the group 
factor and the repeated measures (@ = .01). Further analysis using Scheffé’s multiple 
comparison procedure revealed statistically significant differences in mean scores from 
test occasion I to IV for Groups A, B, and C but not for Group D. Group C received 
no treatment but the members were enrolled in an elementary mathematics methods 
class at the time. 


The remaining results were analyzed by considering each of the groups, A and B, 
separately. The purpose was to analyze the group scores on MARS and the Aiken 
Attitude Scales over time, remembering that the scales were given to each group on 
four separate occasions. 


For Group A, a multivariate analysis of the repeated MARS measures indicated 
that the null hypothesis of equal means for all test occasions was not tenable 
(p<.008). Beyond that, the contrast between pre- and post-therapy administrations 
(Test Occasions I and II) was statistically significant (p<.01). With a = .05, all other 
adjacent pair contrasts were not statistically significant. Conceptually, these contrasts 
were the only ones considered important. A similar analysis of Group A’s Aiken 
Attitude Scales results revealed no statistically significant differences. 

For Group B, 2 multivariate analysis of the repeated MARS measures indicated 
that the null hypothesis of equal mean scores for all test occasions should be rejected 
(p<.04). However, none of the adjacent pair contrasts was statistically significant (a 
= .05). Beyond that, it made sense, conceptually, to average the scores from Test 
Occasions I and II and compare that to the average of scores from Test Occasions III 
and IV. (Group B received the treatment between Occasions II and III). This contrast 
was near statistical significance (p <.052). Analysis of the Aiken Attitude Scales results 
for Group B revealed no statistically significant differences. 


Discussion 
Most importantly, systematic desensitization as a therapeutic procedure to 
alleviate mathematics anxiety was successfully demonstrated. This parallels the 
positive results obtained by Hyman with high school students. 

It should be noted that the apparent reduction in anxiety for Group A might have 
been attributable to the fact that the high anxious subjects in that group might have 
been relieved by simply having their anxieties recognized and having therapy available 
to them. This rival hypothesis is less convincing when the pattern of scores for the 
comparable Group B is analyzed. The significant interaction of the MARS repeated 
measures factor and the group factor in the two-way analysis of variance would imply 
that the therapy had a significant impact over and above that due to the subjects’ 
expectations. 

Because of the design of the study, it was not possible to determine which specific 
mechanisms of the therapy (directions, ways of seeing improvement, etc.) might have 
had a positive effect on subjects. These questions are important but were not a focus 
of this study. A more serious design problem was the possible confounding of the 
therapist with the therapy. There was no real way of separating the effect of the 
therapy from that of the therapist in this study. That would appear to be a design 
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concern of special importance for any future research work in this area. 


Demonstrating the feasibility of using desensitization therapy to alleviate 
mathematics anxiety was the primary focus of this study, but beyond that, an attempt 
was made to determine the effect of the therapy on attitudes toward mathematics. 
This attempt was not very enlightening since the effect of the therapy on attitudes 
appeared to be minimal. This is similar to Addleman’s results (1972). Apparently, 
attitude toward mathematics is not directly related to mathematics anxiety. It seems 
that more work of the kind that Aiken has done in identifying facets of attitude is 
required before attitude can be meaningfully related to other personality variables. 


Finally, data were gathered to answer questions about the relationship between 
mathematics anxiety and other forms of anxiety. With a class of 125 students the 
correlations of the MARS with the A-State and A-Trait subscales of the STAI and the 
neuroticism and extraversion subscales of the EPS were 0.412, 0.540, 0.425, and -0.087 
respectively. All were statistically significant (a = .01) except the last one. From these 
results it would appear that mathematics anxiety is not an isolable phenomenon. It 
seems to be related to a more globally defined notion of anxiety. 


In conclusion, this study marks a first attempt in reducing mathematics anxiety 
with elementary teachers-in-training through a behaviour therapy. This is of some 
importance because these teachers have significant influence on a particular age group 
at a time when relatively stable attitudes toward mathematics seem to be formed. 
More work needs to be done in developing an understanding of this anxiety and 
developing effective therapies for it, but the results here reported are certainly helpful 
and encouraging. 
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Kohlberg’s Moral Judgment Program in the Classroom: 
Practical Considerations 


Donald M. Wonderly 


Kent State University 


Joel H. Kupfersmid 


Sagamore Hills Children’s Psychiatric Hospital 


Kohlberg’s Theoretical Position 


Kohlberg postulates that the term “moral” refers to a judgment or way in which 
one engages in the decision-making process. He notes that this “moral” type of 
decision-making process is confined to the structure or rationale one employs to 
resolve competing claims of priority, such as one’s choice about an individual versus a 
legal right or the right of a person to life versus property jurisdiction. 


Kohlberg claims that how one decides which competing claim has _ priority 
determines one’s moral maturity level. Kohlberg’s analysis of child and adolescent 
moral judgments made to certain standardized dilemmas has led him to the 
proposition that human beings go through what he described and distinguished as 
three levels of moral judgment with two stages of moral thought incorporated within 
each level. These stages have been presented often. The reader is referred to 
Kohlberg’s earlier works for a detailed summary. 


Moral stage structure and transition follow closely Piaget’s stages of intellectual 
development. That is, certain modes of cognitive organization are necessary, though 
not sufficient, conditions for acquisition of higher levels of moral judgment. Moral 
stage transitions occur under conditions of disequilibrium involving moral conflict. A 
moral conflict is a situation where the claims of two or more individuals are discrepant 
(Kohlberg, 1971a). 
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In essence, the conceptualization of developmental stage sequence, as maintained 
by Kohlberg and his colleagues, involves: (a) a prerequisite cognitive organization of 
structural wholeness that, (b) undergoes invariant transitions contingent upon 
maturational and environmental interaction. Simultaneous with cognitive changes are 
transformations of moral judgment when given a diversity of role-taking opportunities. 
Also, (c) the stage position postulates cognitive disequilibrium as a motivational 
condition which the individual attempts to resolve, and (d) resolution of the 
disequilibrium in moral conflict situations results in stage fixation or stage 
progression. 


Kohlberg’s Recommendation to the Schools 


Kohlberg maintains that students are moral philosophers in that they innately 
formulate ideas about morality in terms of organized schemata (Kohlberg & Gilligan, 
1972; Kohlberg & Turiel, 1971). Students, likewise, work in a moral environment, i.e., 
situations where rules, standards, values, and principles are present (Turiel, 1969), and 
educators are constantly engaged in the “hidden curriculum.” This curriculum is one 
designated by the exposition of certain attitudes, values, and principles without actual 
formulation of moral goals and techniques (Kohlberg, 1966, 1970, 1971b). 


The nature of moral education is a process in which moral teaching (i.e., 
classroom discussion aimed at stimulating students’ movements to the next stage of 
moral development) is practised (Kohlberg, 1964, 1967; Kohlberg & Turiel, 1971). The 
ultimate goal of moral education is the development of principled level moral 
judgments (Kohlberg, 1971, 1971b; Kohlberg & Turiel, 1971). 


The techniques involved in moral education include, first, that the teacher have 
knowledge of the student’s present moral stage, that discussion focus on genuine moral 
conflicts conducive to classroom disagreement, and that the argument about the 
morally appropriate resolution of the conflict be stated at one stage above the present 
stage of the students (Kohlberg, 1966; Kohlberg & Turiel, 1971). Second, stimulation 
to higher stages results when the teacher, taking the position one stage above his 
students, focuses on the pupil’s reasoning process and promotes a state of cognitive 
disequilibrium within the child before a suitable alternative solution to the dilemma is 
offered (Kohlberg & Turiel, 1971). A third aspect of moral education is that of peer 
and school participation. It is believed that belonging to a diversity of groups may 
provide greater role-taking opportunity, thus stimulating stage progress. Moreover, 
Kohlberg (1971b) insists, “While participation in the peer group is important, the 
ultimate issue of participation is, of course, the school itself. Here the principle of 
stimulation is by a justice structure one stage above the child’s own” (pp. 283-284). 

Kohlberg makes four basic claims for the appropriateness of promoting higher level 
moral judgments. First, he reports that cross-cultural studies suggest that the stages of 
moral judgment he has delineated are the natural progression of an individual’s 
biological-psychological development. He argues, 


Accordingly, the teacher may take the stimulation of moral development as the aim of 
moral education. Such stimulation of development is not indoctrination; rather it is the 
facilitation of the child’s development through a sequence which is a natural progression 
for him. (Kohlberg & Turiel, 1971, p. 439) 


Secondly, Kohlberg proposes that as one moves through the moral developmental 
stages he is able to deal more effectively and more adaptively with his environment. 
Persons at higher stages also maintain a greater cognitive consistency of beliefs, 
attitudes, and judgments than those at lower judgmental levels (Kohlberg, 1971a; 


Turiel, 1971). 
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Third, Kohlberg and Turiel (1971) claim that higher moral reasoning results in 
more desirable behaviours and more consistency in action across situations. 


Persons at a higher level of moral development not only reason better, but they act in 
accordance with their judgments. Experiments demonstrate that principled persons act 
more honestly and live up to their beliefs in the face of inconvenience and authority more 
so than do people or children (sic) at lower stages. (p. 414) 


Fourth, Kohlberg suggests that stage 6 reasoning is philosophically the most viable 
form for resolving competing moral claims: 


Almost all moral philosophers, as well as moral leaders like Abraham Lincoln or Martin 
Luther King, have an ethic based upon an advanced stage of moral development. The 
moral stages do not represent an American middle-class-value bias; they are universal. 
(Kohlberg & Turiel, 1971, p. 414) 


In addition, Kohlberg and Elfenbein (1975) note that “the Stage 6 principle of justice 
as equity is the only intuitively plausible and philosophically defensible test which 
does not suffer from the defects of vagueness, inconsistency, and indeterminacy” 
(p. 639). 

Kohlberg in addition claims that utilizing his curriculum is constitutional because 
“Stimulation of universal stages of moral reasoning is not indoctrination. It violates no 
civil rights and is independent of any religious doctrines or assumptions” (Kohlberg & 
Turiel, 1971, p. 414). Further, Kohlberg (apparently arguing from a postconventional 
perspective) proposes that even if his moral maturity curriculum were 
unconstitutional, it still should be implemented: 


The constitutional issue arises from the point of view of the child’s parent, who can object 
to the teaching of values other than the parent’s own. Respect for the parent’s rights is 
not respect for the child’s autonomy, a more legitimate concern. (Kohlberg, 1971b, p. 72) 


While the authors have taken issue with a considerable number of points in regard 
to both Kohlberg’s theoretical orientation and to his claims of empirical support in 
other places, the current discussion will focus on only one issue: the practicability of 
implementing Kohlberg’s moral discussion curriculum in the school. 


The Feasibility of Kohlberg’s Curriculum 


Kohlberg’s moral maturity program is based on moral discussions taking place in 
the classroom and on a school-wide basis, with the aim of stimulating students’ moral 
stage progression until a postconventional structure of moral conflict resolution is 
realized. Specifically, Kohlberg’s program requires that: 


1. Teachers have knowledge of their students’ present moral stage. 
2. Classroom discussion focus on genuine moral conflicts. 


3. The teacher promote cognitive disequilibrium within the student as a means to 
foster stage transition. 


4. Resolution of a particular moral conflict be presented by the teacher utilizing 
moral reasoning one stage above his students’ present functioning. 


5. Students be encouraged to engage in many peer participatory activities. 


6. Ultimately, the entire school be operated such that students, teachers, and 
administrators engage in mutual and participatory moral conflict resolutions. 
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It is the purpose of the present discussion, thus, to explore each of the above 
points in some detail. The goal is to determine if the application of Kohlberg’s 
curriculum to the classroom and school appears viable given the current state of the 
educational, as well as the socio-political, communities. 

Teachers must have knowledge of their students’ present moral stage. In order to 
achieve this goal, teachers would first have to be trained (either in the universities 
and/or through inservice training) by persons expert in Kohlbergian theory. The 
training of school teachers in identifying student moral stage responses would require 
a considerable number of training hours, given the experiences of these authors and 
some of their colleagues. 


In order for the teacher to lead a discussion on moral issues effectively, the ability 
to process quickly a student’s response for immediate moral stage estimation would be 
required. The ability of the majority of teachers to organize, process, and respond 
appropriately is suspect. 

Rest (1974) states: 


Judging from the amount of time it takes to stage score a regular interview and the 
amount of time it takes to compose unambiguous stage-prototypic statements, it is 
unrealistic to expect a teacher to be computing +1 retorts to students in a group 
discussion. (pp. 248-249) 


A doctoral dissertation by Napier (1975) tested teachers’ ability to code correctly 
subjects’ statements made to Kohlberg’s Moral Judgment Scale, using Kohlberg’s 
scoring manual (rather than sheer recall) as their guide. Sixty-seven elementary school 
teachers served as subjects. Napier concluded, “overall subjects could not correctly or 
accurately assess moral statements” (p. 5201A). 

A second implication with respect to teachers’ ability to note the moral maturity of 
their students appears to be that teachers would have to administer and score 
students’ responses to Kohlberg’s Moral Judgment Scale at least in the junior and 
senior high school. (In the elementary grades a preconventional level may be 
assumed.) Group administration and written responses to the Scale could be 
undertaken in the high school, provided students’ reading and writing ability was not 
a problem. For nonreaders, individual oral administration for each student would be 
required. It has been our experience that oral administration of the scale takes 
approximately one to one-and-a-half hours per individual. 

Additional time considerations are involved even when tests are not individually 
administered. Someone must score each moral protocol, at least in the high school, in 
order to determine the students’ probable moral maturity. Thus (even ignoring 
Napier’s results or Rest’s comments) time considerations appear enormous, especially 
at the junior and senior high school levels. It has been the junior author’s experience 
that a trained scorer who has considerable practice in scoring moral maturity protocols 
requires about an hour of scoring time per individual. For junior and senior high 
school teachers teaching five courses per day with 30 pupils per course, 150 hours 
would be consumed in scoring students’ responses. 

Third, it must be noted that Kohlberg is constantly revising his scoring manual, 
thus making each succeeding year’s edition somewhat obsolete. The cost of the scoring 
manual (reported at approximately $40) may also tend to discourage its widespread 
use and dissemination to the schools. 
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Fourth, in order for a teacher to respond appropriately (in the Kohlbergian sense, 
one stage above the student’s present level of functioning), the teacher must have the 
prerequisite cognitive and moral sophistication. In the elementary school this may not 
be a serious problem. However, the teacher’s cognitive capacity and moral judgmental 
level become critical at the high school level where many students function at a stage 
4 mode. In order for the teacher to respond adequately to stage 4 reasoning would 
require the employment of a stage 5 response. In addition, a prerequisite to stage 5 
decision-making includes the capacity to reason at the formal operations level. 
According to Kohlberg, approximately 50% of the American population reaches formal 
operations (Kohlberg & Gilligan, 1972) and, currently, only 25% of the population is 
operating above a stage 4 morality (Kohlberg, 1971a). 


Granted that teachers are perhaps above the “normal” population on the average, 
there remains serious doubt that the school could maintain a teaching staff if 
principled moral judgments became a precondition to being hired. The ACLU, 
teachers’ unions, and other organizations are hardly apt to allow for the introduction 
of such a stipulation! 


Proponents of Kohlberg’s program might argue that teachers could be trained 
(through his moral curriculum) to resolve moral conflicts in terms of postconventional 
criteria. However, the time and cost of training would probably be more expensive 
than waiting for “principled thinkers” to appear at the school doorstep with teaching 
credentials. Kohlberg’s means of resolving this problem thus far, in his “just 
community” project, is to assign six teachers plus two consultants for 35 students. 
There is some discrepancy on the number of students in the just community; Muson 
(1979) reports 35, whereas Power and Reimer (1977) report 65 students. Of these six 
teachers, two have Ph.D.’s and one is currently a doctoral student (Muson, 1979). 
Kohlberg’s resolution to the problem of teacher’s cognitive and moral sophistication is 
hardly viable for public education. 


Classroom discussion should focus on genuine moral conflicts. The difficulty with 
this aspect of the moral maturity curriculum seems to be related to precisely what 
Kohlberg means by a genuine moral conflict. He suggests that a genuine moral conflict 
results when “disagreement about problematic situations” arises (Kohlberg & Turiel, 
1971, p. 416). To Galbraith and Jones (1975): 


A good moral dilemma should have three characteristics. It should present a real conflict 
for the central character; it should include a number of moral issues for consideration; and 
it should generate differences of opinion among students about the appropriate response 
to the situation. (p. 18) 


The unresolved issues involve the apparent subjective nature of an arousing 
situation (in addition to specifying in detail what a real conflict for the central 
character is). Given the variability among students (especially in the junior high school 
years and beyond) in cognitive and moral sophistication, a particular dilemma may be 
genuine for one student but not another. 


In light of the variety of stages at which pupils may be functioning in a particular 
classroom (especially at the junior and senior high school levels), a dilemma presented 
for classroom discussion may be genuine (i.e., conflict-arousing) for some individuals at 
a specific stage of moral development but not perceived as a genuine conflict for those 
at lower or higher stages. Thus, the student functioning at a lower stage may not be 


able to appreciate the various competing claims inherent in the dilemma, whereas the 
student at a higher stage may consider the resolution to the dilemma obvious. 
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More recently, Kohlberg and Hersh (1977) have amplified the Galbraith and Jones 
description: 


What can teachers and schools do to stimulate moral development? The teacher must 
help the student to consider genuine moral conflicts, think about reasoning he uses in 
solving such conflicts, see inconsistencies and inadequacies in his way of thinking and find 
ways of resolving them. (p. 57, italics added) 


The ambiguity of genuineness of a moral conflict or of how teachers can assist 
students in conflict resolution still seems far from clear. 


Teachers should promote cognitive disequilibrium within the students as a means 
to promote stage transition. The genuineness of a moral dilemma is based on its 
arousal potential for promoting cognitive disequilibrium. However, like genuineness of 
a moral dilemma, the exact nature of cognitive disequilibrium appears somewhat 
vague. 


It is difficult to be sure what Kohlberg is asserting because he gives no account of 
what he means by “teaching” or by “cognitive stimulation,” with which he contrasts 
teaching (Peters, 1971, p. 243). Even if teachers were presented with a clear 
description and had the ability to recognize a cognitively disequilibrated individual, 
there would be no assurance that the method used to create cognitive disequilibrium 
for an individual at one moral stage would be appropriate for an individual at another 
moral stage (Weinreich, 1974). 


Third, evidence for disequilibrium as a precondition for stage transition has not 
been provided. Two empirical investigations have failed to find support for the 
contention that those identified as disequilibrated are in fact experiencing such a state 
(Sullivan & Quarter, 1971;) Wonderly & Kupfersmid, 1978). Aron (1977) notes, 
“Kohlberg pays lip service to the term ‘equilibration,’ but he offers no criteria by 
which one can determine whether or not an individual is actually undergoing this 
process” (p. 212). 

Lastly, even if a clear description of cognitive disequilibrium may be offered and 
evidence is produced which indicates that disequilibrium can be generated for each 
moral judgmental stage and that disequilibrium wil! result in stage transition, there 
still remains the problem of providing teachers with viable techniques to promote the 
disequilibrium. To date, Kohlberg appears to have omitted presenting the schools with 
the precise means by which teachers can generate such disequilibrium. 

Teachers should provide resolution to moral conflicts utilizing moral reasoning 
one stage above their students’ present functioning. Problems with this aspect of 
Kohlberg’s program have already been noted. As was stated by Rest (1974), it is 
unreasonable to assume that teachers can quickly and accurately provide +1 
responses in group seminars. Rest also noted that responding one stage above that of 
one’s students would probably not be very beneficial in fostering moral stage 
progression, anyway: 


There are two reasons to suppose that +1 modeling is not essential: first, cognitive 
developmentalists, including Piaget, usually describe the acquisition of new structures in 
terms of the single individual trying to reorganize his thoughts, not learning from a model; 
and second, new cognitive structures can be evulved in the absence of higher stage models 


(Who was +1 for Socrates?). (pp. 245-246) 
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The problem of a student understanding a response one stage above his current 
functioning has not been adequately resolved by Kohlberg. This is especially apparent 
when a +1 response requires a more advanced level of cognitive sophistication (such 
as, providing a stage 3 response to a stage 2 child, or offering a stage 5 retort to one 
functioning at a stage 4 rationale). 

Another apparent problem with utilizing a +1 response to resolve moral issues in 
the classroom may be offered. In a typical high school class, one supposes that the 
spread of moral developmental variability approximates the normal curve depicted in 
Table 1. 


TABLE 1 
THE MORAL JUDGMENT DISPERSION 
OF A HYPOTHETICAL HIGH SCHOOL CLASSROOM 


Frequency 
2.3% 13.5% 34% 34% 13.5% 2.3% 
1 2 3 4 5 6 


Moral Stage 


It appears that the +1 resolution offered by the teacher under such conditions 
may not produce the desired outcome (i.e., stage progression). As an example, the 
teacher is first faced with which moral stage reflects the “norm” of the classroom. If 
the teacher decides to resolve the presented dilemma by utilizing a response one stage 
above those arguing at a stage 3 rationale, then he may run the risk of: (a) 
re-affirming those who are maintaining a stage 4 position that they are “right,” (b) 
shaking the reasoning of those who are currently in a transition from stage 4 to stage 
5, (c) perhaps alienating, weakening, or losing the confidence of those students 
currently functioning at a principled level, and (4) stating a resolution that is too 
advanced for those students operating at moral stages 1 and 2. Thus, at best, the 
teacher has only potentially influenced those maintaining a stage 3 orientation. This 
situation does not improve if another stage is selected from which a +1 resolution will 
be offered when the classroom approximates the above moral maturity distribution. 
Aron (1977) raises another critical issue: 
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It seems plausible to assume that the discussion leader, having a clear idea of the higher 
stage which the participants are supposed to attain, may give the members of the group a 
variety of cues, conscious or unconscious, which indicate which solution or statement of 
justification is the “correct” one. Successful students are quite practical in detecting such 
cues. Thus, it may be that the participants in the moral discussion group do not actually 
experience any cognitive reorganization, but simply learn to give those answers which 
satisfy the discussion leader, and which earn them higher scores on the posttest. (p.211) 


Students should be encouraged to engage in many peer participatory activities. 
While most educators, including the authors, would not object to students engaging in 
many group activities, there is some concern in how to implement such a 
recommendation in the school and classroom. 


The first concern focuses on specifically what kind of peer participatory activities 
Kohlberg is suggesting as a means of promoting moral stage progress. In reviewing the 
literature, he does not appear to have delineated the appropriate kinds of activities. 
Surely Kohlberg is not suggesting that those who participate in youth street gangs 
should be encouraged to continue such participation. 


An allied difficulty is the age-related nature of peer participation. There needs also 
to be a delineation of peer activities appropriate for moral development for various 
age groups. The contention is that a peer participatory activity conducive to moral 
development for a nine-year-old would not necessarily be a fruitful activity in 
promoting moral progression for a nineteen-year-old. Similarly, group activities that 
may foster moral transition for males at various ages may be different than the kinds 
of activities which would encourage moral development in females. Likewise, peer 
activities that would, for example, promote transition from a stage 3 to a stage 4 
moral perspective would not necessarily promote moral development for those 
progressing from stage 4 to stage 5. 


A third difficulty that arises in promoting peer participation is the extent to 
which teachers are currently trained in this area. Teacher training programs usually do 
not provide instruction in how to be a group leader, how to facilitate positive peer 
interaction, and the dynamics of peer groups at various chronological ages. 

Ultimately, the entire school should be operated on a participatory basis for 
moral conflict resolution. Two basic difficulties, in addition to those already 
suggested, appear with respect to the implementation of a school-wide Kohlbergian 
program. 

First, acceptance and satisfactory implementation of such a program seem to be 
predicated on the ability of principled level moral decision-making by the majority of 
teachers, administrators, and school board members. This condition seems hardly 
possible given the current estimate of postconventional individuals reported by 
Kohlberg. Unless Kohlberg is suggesting that all educators first receive moral training 
by his colleagues (which would increase the financial and time costs of his program 
considerably), the potential for a school-wide moral maturity curriculum, at present, 
seem unrealistic. Sullivan (1977) outlines his approach to utilizing Kohlberg’s program 
in the Tacoma Public Schools. The cooperation of teachers, principals, superintendent, 
board of education and community members were all required prior to project 
initiation. While Sullivan does not provide the reader with a time-table, his 
description suggests one to two years planning prior to any significant classroom 
implementation. Even then, the results of the program after classroom implementation 
are not reported. Peters (1971) offers a more viable strategy: 


135 


D. M. Wonderly and J. H. Kupfersmid 


He (Kohlberg) does not take “good-boy” morality seriously enough either from a practical 
or from a theoretical point of view. Practically speaking, since few are likely to emerge 
beyond Kohlberg’s Stages 3 and 4, it is important that our fellow citizens should be well 
bedded down at one or the other of these stages. (p. 678) 


Hamm (1977) concurs, stating that teaching the “bag of virtues” Kohlberg disclaims 
is very satisfactory for the majority of citizens. 


A second major difficulty with implementing a Kohlbergian program at the 
school-wide level seems to be that of variability in the moral stages of the crucial 
participants, namely, the students. Even if all pedagogists functioned at stage 6, what 
moral stage would be utilized to promote moral maturity, when meeting to discuss 
school-wide concerns, among individuals spanning 13 years and all moral stages? Orr 
(1974) speaks to this issue: 


There are many points at which Kohlberg has simply not taken the time to refine his 
proposal for a restructured school (e.g., he has not dealt with the implications of 
cognitive-social development for age segregation in various forms of decision-making 
within the school. (p.370) 


Serious doubts as to the feasibility of utilizing Kohlberg’s program have been 
offered for each step of the implementation phase. There remains one major area to 
be discussed: the results of those who have tried to promote higher moral reasoning 
within the schools. While the arguments offered may appear to be reasonable 
deterrents, there is the possibility that investigators may have overcome some of the 
obstacles sufficiently (at least on a small scale) to have promoted the moral maturity 
of students. 

A brief review of the most available literature reveals two basic approaches. One 
has been to employ a Kohlbergian curriculum in the classroom for a specified period. 
The second has been the utilization of potential components of a Kohlbergian program 
such as role-playing, teaching moral philosophy, and lecturing on Kohlberg’s tpegty, 
and supporting research. 

It should be noted that with the exception of Kohlberg’s experimental attempts of 
a just community, most moral maturity programs have only been implemented in one 
or two classrooms for several weeks or a few months. However, Kohlberg has not 
specified his program sufficiently to mitigate the variability among studies with 
respect to program implementation. Also, the age of program participants is variable 
across studies, which makes generalizations precarious. 

With respect to the literature reviewed utilizing discussion of moral dilemmas as a 
treatment basis, the outcomes are almost equally divided. A summary of these studies 
is provided in Table 2. 


When interpreting Table 2, “support” indicates that there was a statistically 
significant difference in at least one of the following conditions: (a) the individuals 
engaged in discussion of moral dilemmas increased their moral maturity scores from 
pretest to posttest, or (b) the treatment group’s moral maturity score differed 
significantly from the control’s score at the posttest. “No support” would indicate that 
the above criteria were not met. 


Other investigators have attempted to raise the level of moral judgment in a 
variety of ways. Table 3 presents a brief review of the various means researchers have 
employed to promote moral maturity. 
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TABLE 2 


SUMMARY OF RESEARCH UTILIZING MORAL DISCUSSION 
TO PROMOTE MORAL MATURITY 


Investigator Subjects Length of Treatment Results 
Blatt(In 12 - 11 & 12 yr. old One weekly meeting for 12 wks. (duration Support 

Kohlberg & Jewish children of meeting not provided 

Turiel, 1971) 

Paolitto (1976) 17 - 8th graders from Not stated Support 
disadvantaged 
environment 

Kavanagh (1975) 96 high schoolers 10 wks. (no further information provided) Support 
Keefe (1975) 70 - 8th graders 50 min. sessions, once/wk. for 10 wks. Support 
Grant (1975) 50 clinical pastoral 1.5 hrs/session (no. of sessions not No support 
students provided 
Williams (1974) 24 - 7-9 yr. olds 3 weekly sessions for 20 min./session for 8 No support 
wks. 
Enderle (1974) 23 educators daily meetings for 1 wk, 3 hours/day No support 
Bong (1975) 45 - 6th graders 30 min./day 3 days/wk. for 10 wks. Support 
Turiel (1966) 44 - 7th grade males not stated, but presumably 1 to 2 hrs per No support 
subject 
Turiel & 43 - 7th & 8th graders not stated, presumably 1 hr./subject No support 
Rothman (1972) 
Tracy & Cross 76 - 7th grade males not stated, presumably about 1 hr/subject Support 


TABLE 3 


SUMMARY OF NON-MORAL DISCUSSION METHODS 
USED TO PROMOTE MORAL MATURITY 


Method Used 
to Promote 
Moral Maturity 


Length of 
Treatment 


Subjects 


Results 


Investigator 


Course in Ethics 
Course in Sociology 


Non-Directive 
role-taking 
Role-playing 


Self Awareness 
Group 
Decision-making 
Group 
Democratic 
child-rearing 
training 
Democratic 
child-rearing 
training 
Interpersonal skills 
training 


Peer counseling or 
peer tutoring 


Not stated 
Not stated 
2 sessions/wk. for 2 


mos. 
55 min. sessions, 


2/wk. for 10 wks. 


same as above 
same as above 


2.5 hrs./wk. for 10 
wks. 


same as above 


3 hrs./wk. 
(duration not 
provided) 

not stated 


24 undergraduates 
24 undergraduates 
5th graders 


55 - 5th graders 


same as above 
same as above 


10 parents & their 
adolescent children 


12 parents 


high school 
students 


same as above 


no change in moral 
maturity 

no change in moral 
maturity 

increase in moral 
maturity 

no difference in 
moral maturity 
between groups 

same as above 


same as above 


no difference in 
moral maturity 


same as above 


no change in moral 
maturity from 
pretest 

same as above 


St. Clair (1975) 
St. Clair (1975) 
Travis (1974) 


Strong (1975) 


Stanley (1976) 


Stanley (1976) 


Sprinthall (1972) 


Sprinthall (1972) 
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The results generally indicate that means other than moral discussion are even less 
effective in promoting moral maturity. Thus, the contention that implementation of 
Kohlberg’s curriculum in the schools seems unfeasible at present appears to be borne 
out by empirical findings. 

The two projects attempting to create a just community by Kohlberg have 
purposely been omitted thus far. The results of these two programs are currently quite 
controversial, therefore inclusion as support or nonsupport might be misleading. 


The most recent analysis of Kohlberg’s attempt at a just community in the prisons 
and schools is offered by Feldman (in Muson, 1979) and is soon to be published. With 
respect to the prison community, Feldman notes that there was little discussion of 
justice at group meetings and many community rules were handed down by 
administrators. Likewise: 


Feldman both examined the results of the moral-judgment interviews with the inmates 
undertaken by Kohlberg’s associates and conducted interviews of his own. He concluded 
that the program did nothing to raise levels of moral reasoning or to lower recidivism 
rates. He charges that those scorers who did report improvement often knew the dates of 
the interviews — which could bias them in judging changes over time — and that they were 
under pressure from one of the consultants of the program to produce positive results. 
(Muson, 1979, p. 58) 


Feldman’s evaluation of the just community in the high schools is also pessimistic. 
He contends, “one senior administrator used the term ‘blackmail’ to describe how the 
staff had gotten the community to agree not to permit pot-smoking —- a politically 
sensitive issue — at a school event” (Muson, 1979, p. 68). Further, “Feldman also 
interviewed senior school administrators in Cambridge about the high school project 
and concluded the democracy in all three programs was a sham” (Muson, 1979, P. #51). 
Power and Reimer’s (1978) description of the just community has a more optimistic 
flavour, but their report does not contradict Feldman’s conclusions, nor are there data 
provided which would suggest after three years that the just community is making 
moral or behavioural advancement. 


The specifics of Kohlberg’s program in a variety of areas seem lacking. The 
expectation placed on school personnel necessary for successful implementation 
appears unrealistic, and the current research data suggest that one’s moral maturity is 
not readily subject to alteration. It is tempting to propose that further study be done 
of the efficacy of such programming. Unfortunately, results to date suggest that there 
is serious reason to question the legitimacy of any further investment of energy. Once 
again, we seem to have discovered a parameter of man that invites investigation. The 
fact that it doesn’t stand the test rarely discourages continued conviction as to its 
validity. 


Requests for reprints should be sent to D. M. Wonderly, Department of Counseling and Personnel Services, Kent State 
University, Kent, Ohio, 44242. 
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RESEARCH NOTES 


IMPLICATIONS OF THE RIGHT TO EDUCATION 
LEGISLATION ON EDUCATIONAL RESEARCH 


One of the goals of the International Year of the Child was to ensure that all 
children have a Right to Education. Until recently children in the United States, with 
the exception of many of the handicapped children, did have access to public 
education. In 1975, the United States Congress passed legislation. The Education of 
All Handicapped Persons Act, which now guarantees that every child in the U.S. will 
have a Right to Education. The U.S. law, Public Law 94-142, requires that 
handicapped children receive an “appropriate” education in the least restrictive 
educational environment. An appropriate education is defined according to currently 
accepted educational methodology. This U.S. legislation even includes a statement 
concerning educational research: that state and local school districts should initiate 
research related to the development or improvement of special education programs. 
However, some educational research may contravene the intent of this law, and thus 
specific aspects of the legislation could inhibit or confound certain research designs. 


First, Public Law 94-142 states that all children identified as requiring special 
education must be placed in an appropriate educational program immediately. Any 
delay in the placement of a child into an appropriate program following the 
identification of special educational needs may serve as the basis for litigation against 
the school or school district. Since programming must begin immediately, any design 
which uses identified but unserved exceptional children as control subjects is no longer 
acceptable. 

Second, the program for an exceptional child is designed using a team approach 
with the parents included as part of that team. Deviations from the goals, objectives, 
procedures, and methods indicated in the program are not acceptable. No longer can 
exceptional children be included in a research project that is not a part of their 
designed educational program. For example, a research design which uses exceptional 
children as a comparison group matched on CA or MA to normal children may no 
longer be allowed if the involvement of the exceptional child interferes with or has no 
bearing on the established appropriate educational program. 
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Third, research related to systematic development of educational programs is 
limited by this legislation. While the federal law indicates that new, innovative 
programs should be developed, in a “Catch 22” scheme the law limits or restricts 
research related to the empirical validation of any newly developed program. The law 
does so by stating that the child must be placed in an appropriate educational 
program. If the child is placed in a different program (an experimental program) the 
merits of that program must be explained to demonstrate that the new program is 
appropriate and is, according to currently accepted methodology, as adequate as the 
old program. The “Catch 22” aspect of this exercise is that until the experimental 
program is empirically validated, the merits of that program cannot be confirmed. 
Thus, the use of a contrast research design in which some students receiving an old or 
established program are contrasted with matched students receiving a new program 
may be challenged and serve as the source of litigation against the school or school 
district. Since the burden of establishing the efficacy of a new program rests with the 
individual States, most States are reluctant to initiate research which may result in 
legal action against that State’s educational system. It is now easier for individual 
States to “pick-up” a program from elsewhere rather than initiate special education 
research and go out on a limb. 

Right to Education Laws to include all children, similar to the U.S. federal 
legislation, have’ been discussed in many nations at various governmental levels. In 
Canada the apparent reluctance of provincial assemblies to move on a Right to 
Education Law suggests that legislation as comprehensive as that in the U.S. is not 
likely. However, a modified form of Right to Education Legislation should soon be 
passed in some provinces. The effect of Right to Education legislation on educational 
research in any country where such laws exist is an important issue. 

The indirect effect of Public Law 94-142 in the U.S. has been to control and/or 
reduce educational research related to exceptional children. While the U.S. legislation 
was not directly aimed at controlling educational research, the effect was achieved. 
Government has the ability to direct and control research by the manner in which 
research funds are allocated as well as by direct legislation. 

The issue in relation to educational research revolves around the relevance of and 
need for educational research and the freedom to choose a topic of concern and an 
appropriate research design as opposed to choosing a research topic and design which 
will likely be free of governmental and legal involvement. In Public Law 94-142 the 
funding issue is secondary to the restriction placed on educational research by the 
constant threat of litigation. While this law presently restricts educational research 
related to special education students, there is a possibility that similar legislation may 
emerge for all school age children with similar limitations related to educational 
research. The questions raised by Public Law 94-142 are as follows: 


1. Is there an actual need for educational research in any or all of the education 
spectrums? 

2. Is government competent in selecting relevant topics and directing appropriate 
educational research? 

3. Should there exist the right to carry out research in accordance with theoretical 
or empirical dictates as opposed to government law and doctrine? 

4. To what extent is legislation such as Public Law 94-142 setting a precedent into 


the future restriction or control of educational research? 
Charles Norman 
Department of Educational Psychology 
The University of Alberta 
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BOOK REVIEWS 


ESSENTIALS OF EDUCATIONAL MEASUREMENT (ard ed.). 
By R. L. Ebel. Englewood Cliffs, N.J.: Prentice-Hall, Inc., 1979, 388 pp. 


Textbooks about the principles of measurement and evaluation in teaching are 
abundant. Assuming that students spend two to three times as many hours with their 
books as with professors, prudent selection from the poll of available texts is 
warranted. In my judgment, at least four major factors should enter into 
decision-making about selection: content covered, bias in recommendations about the 
application of principles, clarity of presentation, and provision for students to practise 
the principles. Whereas textbooks should vary on the first two of these factors to 
accommodate different professors’ values and unique purposes of a course on 
measurement within an overall program of teacher education, no text should falter in 
the latter two areas. 


Ebel’s third edition of Essentials of Educational Measurement is titled accurately. 
The book definitely sticks to educational measurement, as opposed to the broader 
domain of psychological measurement, and it does not venture much beyond 
essentials. Ebel opens with short chapters about the present status of educational 
measurement and its roles in the processes of education. The next 13 chapters deal 
with principles for constructing, administering, scoring, evaluating, improving, and 
communicating the results of classroom-based paper-and-pencil tests of achievement. 
The traditional item formats (essay, true-false, multiple-choice, short-answer, and 
matching) are covered through a rule-example-discussion presentation which almost 
always pairs a good with a bad illustration of preferred practice that is then examined 
in the discussion. Following these chapters, Ebel turns to the simpler statistical 
principles underlying scoring, item analysis, and reliability. The emphasis in these 
chapters is on improving measuring procedures used in classrooms, and the 
presentation again follows a rule-example-discussion format. Following a short chapter 
about validity regarding teacher-made tests, four quite brief chapters examine 
standardized achievement tests, standard scores and norms, intelligence and aptitude 
tests, and personality and attitude measures. The depth and breadth of discussion 
throughout can be estimated to some extent by this book’s 375-page length compared 
to others that typically take 500 to 650 pages to cover these same areas. Among topics 
sometimes covered in other books but omitted or analyzed only very briefly in Ebel’s 
text are: observational procedures, design and evaluation of work samples such as 
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individual projects, sociometric and other procedures for analyzing classroom 
environments, and special assessment procedures such as those involved in case 
studies or for study of exceptional learners (e.g., learning-disabled students). These 
omissions highlight Ebel’s focus on essentials for measuring classroom-relevant 
achievement, and certainly reflect procedures that most agree should be prominently 
used by teachers in today’s schools. 


Ebel’s bias about measurement techniques and their use in teaching is never in 
doubt. Four very clear messages are presented throughout the book. First, Ebel finds 
little value in jumping on the criterion-referenced test bandwagon, and states flatly 
that item analysis and reliability statistics do not need to be tailored specially to be 
used validly in such tests. Second, perhaps the most clarion note throughout the text 
is that teachers should not build a test, use it, and then discard it’ or leave it 
untouched until next year. Only by serious and skilled efforts to improve successive 
versions of test items will measurement in classrooms achieve high-quality, 
professional status. The third bias found in the book may grate on those who cherish 
traditional conceptions about content, criterion, and construct validity. According to 
Ebel, validity in classroom tests is a function of clear specifications of test content 
achieved via operational descriptions of the tasks it contains, the test’s reliability, and 
whether the test score is used in a rationally defensible manner. Finally, the chapter 
on measuring personality, attitudes, and interests communicates strongly that 
classroom teachers very likely will not profit by using paper-and-pencil tests to gauge 
these factors. 


Turning to the matter of clarity, Ebel is to be praised. His descriptions and 
explanations of principles, illustrations of their application, and rationales about the 
practice of sound measuring procedures in classrooms are excellent. Each chapter ends 
with a terse but complete set of summary statements that really do summarize the 
preceding content. Cross-references within the text are explicit, and a glossary of 
terms will aid students who study it. Users of the book will want to scrutinize 
formulae for occasional misprints of subscripts, and the table for calculating a KR-20 
coefficient will need explanation before students read it. A second printing should 
remove these very minor problems. Ebel succeeds admirably in promoting attention to 
and comprehension of tables, figures, and formulae by clearly describing their content 
in text. The illustrations of how to calculate item statistics and test statistics are 
superior. 

For practically everyone, mere reading does not ensure the ability to apply 
correctly and easily the principles of educational measurement. The projects and 
problems following most chapters are very brief and sometimes unrepresentatively 
easy. For computational problems, the instructor will need to calculate the answers. 
Students likely will require more practice to achieve competence in applying what 
they know. 

This book, like any other, will meet fully some professors’ needs and fall short of 
others’. As stated in the preface, it clearly is a book for an introductory course. I 
would encourage student teachers to keep the text after the course as a guide to good 
practice, especially if additional assignments were developed to extend the range of 
practice. Teachers returning for course work after some years of practice probably 
warrant a more comprehensive treatment than this book provides. Nonetheless, for 
courses that intend to teach essentials, Ebel’s text certainly should be among those in 
the pool of candidates for adoption. 


Philip H. Winne 
Faculty of Education 


Simon Fraser University 
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CHILDREN’S RIGHTS: LEGAL AND EDUCATIONAL ISSUES. By Heather 
Berkeley, Chad Gaffield and W. Gordon West (Eds.) Toronto, Ontario: Symposium 
Series 9, 1978, 177 pp. (ISBN 0-7744-0169-9) 

The most obvious question about a book which one is reviewing is to ask, “For 
whom was the book intended?” After reading Children’s Rights, this reviewer was 
forced to the conclusion that this book of twelve essays could only be read with 
reasonable profit by those people who had the following kind of background: 


1. A reasonable understanding of the history of education in Western Europe and 
Ontario. 


2. Some knowledge of the history of juvenile delinquency and child care legislation 
in Ontario. 


3. A good knowledge of law and legal principles as these are found in Ontario and 
in the American states. 


4. A special interest in attempting to bring about social change through 
“experimental” means. 


The above make up a formidable array of requirements for any reader. It would be 
better, instead, to look at Children’s Rights as a collection of essays with strong value 
only in a few of the essays for any one particular reader. 

The theme of Children’s Rights is the lamentable lack of children’s rights, 
especially in statute law, in Canada and the United States. 


One of the most thoughtful essays in Children’s Rights is the work prepared by 
Magsino. In his essay there is clear recognition of the enormously complex conditions 
of mass education — hundreds of thousands of teachers dealing with millions of 
children of various ages and abilities. As a result of recognizing this complexity, 
Magsino does not swing wildly in the direction of a statutory Bill of Rights which 
would offer all children the adult rights, for example, of freedom of speech and 
assembly; nor does he swing in the other direction by classing all children as being so 
immature and witless as to deserve deprivation of any rights until they reach the 
magic age of eighteen. Magsino takes a middle course between the “legalistic” and the 
“care” concepts of dealing with children. 

Magsino divides his lists of rights into option rights in which the emphasis is upon 
individual freedom, and welfare rights which have their emphasis upon equality. In 
connection with these rights, Magsino offers interesting observations in support of his 
proposed categories and their contents. These observations or “defence” are an 
important part of the article. 

It is fair to point out one grievous error that Magsino makes, and that is his 
assumption that the Canadian Bill or Rights might well have been expected to provide 
“judicial effectiveness in preserving various individual civil and political rights.” The 
answer is that the Canadian Bill or Rights, of course, applies to federal law and 
jurisdiction only, and the provinces are not affected. At one point in his article, 
Magsino discusses the entrenchment of a Bill of Rights in a new Canadian 
constitution and at that point, he seems to understand that a provincial Bill of Rights 
might be drawn into the national bill-only upon major constitutional change. However, 
previous comments about the federal Bill of Rights will have served to confuse the 
reader, especially when Magsino refers to the cases of Drybones, Lavell, and Bedard in 
relation to the federal Bill. 


A second article worth reading is Manley-Casimir’s, about the topic of 
discretionary justice in schools. This article makes a good companion piece for the 
work done by Magsino, because Manley-Casimir explores most carefully what 
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“discretionary” and “justice” mean for teachers and school administrators. 
Manley-Casimir does not advocate some form of due process for students in which a 
school principal holds a kind of trial with witnesses, evidence, advocates, and 
defenders (and all the other paraphernalia of natural justice) because Casimir seems to 
recognize — as the American Supreme Court did not —that it is nigh well impossible to 
bring natural justice into play every time a student commits some offence against 
school policy. But Casimir does recognize the enormous power that schools have over 
students and he writes well about the great need to confine, structure, and check 
discretionary power. 


Casimir’s article contains two interesting case studies of a man and a woman 
within a particular Chicago School. The male teacher, who was in charge of the 
discipline of boys, was twice as likely to suspend a boy from school for an offence as 
the woman was to suspend a girl for a similar offence. The case materials show that 
the man was greatly concerned with order and control, while the woman was far more 
interested in the effect of a disciplinary measure upon a particular girl and her 
parents. 


Casimir’s concern with confining, structuring, and checking power can be expressed 
in an age-old principle that Casimir used unobtrusively at one point in his essay — 
that members of a class of persons who are similarly situated (e.g., high school 
students) should be similarly treated. 


In D’Oyley’s “Schooling and Ethnic Rights,” the author develops a strong argument 
for the rights of ethnic groups in Canada, and especially for multilingual programs 
which would help ethnic groups to preserve their backgrounds. For D’Oyley, 
multilingualism needs to be encouraged in order that the immigrant child’s claim to 
self-realization in his own culture becomes a right. 


In summary, it is accurate to observe that part of Children’s Rights will have 
limited usefulness for teachers and students in the whole of Canada because of the 
emphasis upon Ontario and the United States. However, about one-half of the book 
contains articles which provide excellent material for stimulating discussion in classes, 
seminars, adult education groups, and the more senior grades of high schools. In fact, 
Eastabrook’s article, “School Change and the Implication for Students’ Rights,” offers 
two excellent case studies of unsuccessful attempts to involve students in school 
policy-making. 

One final word. A reader who is seeking a concise and precise definition of a 
“right” is bound to be disappointed. Children’s Rights was obviously not organized to 
produce a single definition or a single list of what might be subsumed under a single 
definition. Instead, one has to be content with the exploration of various kinds of 
rights — civil, ethnic, intellectual, physical, and participatory. 


K. D. Hodgson 
Department of Educational Administration 
The University of Alberta 
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Alberta Journal of Educational Research Vol. XX VI, No. 3, September, 1980 


J.S. NEUFELD 
University of Regina 
and 
ELAINE COZAC 
Regina Public Board of Education 


A Study of the Self-Concept 
of Intellectually Superior Children 


This study was designed to compare the self-concept of intellectually superior students 
with self-concept of intellectually average students. The nature of the investigation also 
permitted a study of the relationships that existed among self-concept, intelligence test 
performance, reading comprehension, mathematics achievement, and overall composite 
achievement. The sample consisted of nineteen intellectually superior and nineteen 
intellectually average grade nine students. The TIPS, WISC-R, and CTBS were the 
instruments used to gather the data. Analysis of data indicated that no significant 
relationship existed between the self-concept and the four variables examined. 


Various factors determine the behaviour of children. A child’s self-concept is one 
key factor. Self theory holds that the self-concept is the frame of reference 
through which a child reacts with his world and as a result has a powerful 
influence on his behaviour. After fifteen years of research on self-concept, Fitts 
and his associates (Fitts, 1972; Fitts et al., 1971) were convinced that a person’s 
self-concept is the key determinant of his behaviour. The self-concept is one of 


Dr. Neufeld (Ph.D., University of Alberta) is Associate Professor and Head of Educational Psychology Subject 
Area at the University of Regina. His research interests include the self-concept of children, the gifted, adolescent 
values, and characteristics of effective teachers. His clinical interests are in human relations, group processes, and 
marriage enrichment. 

Ms. Cozac (M.Ed., University of Regina) is an elementary school teacher with the Regina Public Board of 
Education. Her special interests are counselling at the elementary level and research on self-concept of children and 


the gifted children. 
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the most important aspects of an individual’s phenomenal world and it tends to 
be relatively stable. 

In recent years there has been a growing concern among educators about the 
forgotten minority in our schools, the intellectually gifted children. There has 
been some controversy whether this group of special children develop different 
self-concepts than do average children. In fact, some psychologists have raised 
the question whether indeed these gifted children perform significantly better 
than average children. 

The importance of the intellectually superior child’s self-concept as it relates 
to his total development has been theorized by many educational theorists. Barby 
(1965) found that the problems of personal adjustment increased with children 
whose intelligence reached scores of over 150. Meeker (1969) stated that the 
gifted are more likely to feel unwanted, unaccepted, and frustrated than are 
average children, due to their sensitivity to their environment. Educators such as 
O’Shea (1965) and Treffinger, Borgers, Reader and Hoffman (1976) emphasize 
that of all the personality variables affecting the overall development and 
eventual performance in the life of the gifted children, self-concept is most 
important. 

Studies on the relationship between self-concept and intelligence have 
produced some contradictory findings. Sears (1963) found that children with 
superior mental ability had more positive self-concepts, more positive attitudes 
toward school, and were more creative than children with average ability. 
However, Nemeroff (1964), Trowbridge(1972), and Busby, Fillmer and Smittle 
(1974) reported low correlations between self-concept and intelligence. 


Studies on the relationship between intelligence and achievement of 
intellectually superior children tend to be rather conclusive. That is, Terman 
(1965), Lovell and Shields (1967), and Flescher (1963) all found high correlations 
between intelligence and achievement of intellectually superior children. Seay 
(1961) found positive relationships between total self-concept levels and level of 
vocabulary, comprehension, and reading skills. Lumpkin (1959), comparing a 
sample of over- and under-achievers, found over-achievers possessed a more 
positive self-concept while under-achievers possessed a more negative perception 
of self. Coopersmith (1959) found low correlations between Iowa Achievement 
Test scores and self-esteem of fifth and sixth grade children. Williams (1973), 
however, found no significant correlations between self-concept and reading 
achievement of second grade students. Peper and Chansky (1970), in their study 
of fifth grade students, found there was a positive correlation between self-esteem 


rankings by self and mathematics achievement as measured by the Iowa Tests of 
Basic Skills. 


The Problem 


The study was basically designed to compare the self-concept of intellectually 
superior grade seven students with the self-concept of intellectually average grade 
seven students. Furthermore, investigators compared the intelligence test 
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performance and academic achievement of intellectually superior grade seven 
students and intellectually average grade seven students. A study of the 
correlations between self-concept, intelligence, and achievement was also 
undertaken. 


Hypotheses 
The following hypotheses were tested: 


1. There is no significant difference between the mean self-concept score of 
intellectually superior and intellectually average students. 


2. There is no significant difference between the mean reading comprehension 
grade equivalent score of intellectually superior and intellectually average 
students. 


3. There is no significant difference between the mean mathematics grade 
equivalent score of intellectually superior and intellectually average students. 


4. There is no significant difference between the mean composite grade 
equivalent score of intellectually superior and intellectually average students. 


Definition of Terms 

Self-concept: There are a variety of terms used to define the construct known 
as self-concept. Terms such as self-concept (Coller, 1971; Perkins, 1958; Rogers, 
1961), self-image (Merrill, 1975), and self-perception (McCandles, 1966) are 
representative of the many used by researchers. For the purposes of this study, 
self-concept is defined as the dynamic organization of ideas and beliefs that an 
individual holds about himself. This definition has been used as a basis for the 
development of the Teachers’ Inventory of a Pupil’s Self-Concept (TIPS). 

Intellectual ability: Intellectual ability is defined in terms of the student’s 
performance on the Otis Beta and WISC-R intelligence tests. 

Intellectually gifted: Intellectually gifted subjects were those subjects whose 
intelligence quotient score was 130 (Full-Scale Score) or over as measured by the 
WISC-R. 

Intellectually average: Intellectually average were those subjects whose 
intelligence quotient score was between 90 and 109 (Full-Scale Score) as 
measured by the WISC-R. 


Instruments 

1. Teachers’ Inventory of a Pupil’s Self-Concept (TIPS). The TIPS is a 
self-report self-concept inventory developed by teachers in the Seven Oaks School 
District in Winnipeg, Manitoba, Canada, in 1975. Faculty members from the 
College of Education within the University of Winnipeg assisted the teachers in 
the development of the inventory. The Winnipeg teachers’ objective in developing 
the inventory was to devise an effective way of assessing a Division II pupil’s 
self-concept so that appropriate teaching strategies could be developed to help 
those students in need of viewing themselves more positively. 
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The format of the TIPS reflects the test developers’ desire to overcome the 
unattractiveness of most self-concept inventories. The inventory comes in booklet 
form with each page pastel-coioured and illustrated with cartoon animals (e.g., 
cat, dog, owl, etc.). The booklet contains 13 pages with five statements per page 
to make a total of 65 statements. Statements to which children respond deal with 
their feelings about themselves in a variety of situations relating to work and 
play at home, school, and in the community. 

The student is required to respond to the 65 declarative statements by 
circling either “yes” or “no” for each statement. To avoid having students make a 
set response, the inventory has an approximately equal number of positive and 
negative statements. Hence, both yes and no are appropriate responses to make. 

The inventory is scored by tallying the number of statements answered in a 
way that depicts a positive self-concept. The total possible score for the 
inventory is 65. The higher the student’s score, the more positive his self-concept. 


The TIPS can be administered in a group situation. The inventory has no 
time limit for completion but the writer found that most grade four to seven 
students could complete it within a fifteen-minute time period. 


Test-retest reliabilities for the TIPS ranged from a reliability coefficient of 
.74 for grade seven, .76 for grade five, .86 for grade four, and .88 for grade six 
students. 


2. Wechsler Intelligence Scale for Children — Revised (WISC-R), 1974. The 
test was developed by David Wechsler and copyrighted by the Psychological 
Corporation in 1974. It is a revision and restandardization of the 1949 WISC. 
The WISC-R was designed and organized as a test of general intelligence. The 
test reflects Wechsler’s definition of intelligence as the overall capacity of an 
individual to understand and cope with the world around him. The age range of 
the WISC-R is from six years zero months through to sixteen years eleven 
months. 


The WISC-R is an individual intelligence test that consists of six verbal and 
six performance tests which yield a Verbal Scale I.Q., a Performance Scale I.Q., 
and a Full-Scale I1.Q. The Verbal, Performance, and Full-Scale I.Q.’s have high 
split-half reliabilities across the entire age range, the average coefficients being 
.94, .90, and .96. The split-half: (odd versus even times) reliabilities for the 
individual tests have average coefficients ranging from .77 to .86 for the Verbal 
tests, and from .70 to .85 for the Performance tests. 

The test-retest corrected reliability coefficients range from .63 to .94 across 
three age levels (6.5 to 7.5 years, 10.5 to 11.5 years, and 14.5 to 15.5 years). 

3. Canadian Tests of Basic Skills (CTBS), Form 3, Levels 9-14, 1974. The 
Canadian Tests of Basic Skills are based upon the Iowa Tests of Basic Skills. 
The CTBS are designed to assess levels of skill development in reading, 


vocabulary, the mechanics of writing language, methods of study, and 
mathematics. 
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The format of the CTBS consists of one test booklet of 96 pages for grades 
three through eight. The booklet is divided into various levels and each grade 
starts each of the 11 tests at the level appropriate for the grade placement. The 
questions in the booklet are multiple-choice type. 


The students are asked to respond to the multiple-choice questions by filling 
in the circles on the separate answer sheet that correspond to their answer 
choices for the various questions. 


The tests are scored by totalling the number of correct responses. Raw scores 
are converted into grade equivalents. To obtain the student’s performance in a 
skill area the grade equivalents for the subtests in the skill area are averaged. 
For example, to obtain the student’s skill level attainment in mathematics, all 
the mathematics subtest raw scores would be converted to grade equivalents then 
averaged. To obtain the composite score for the entire test battery, the five grade 
equivalents for the five skill areas assessed are averaged. 


The CTBS were standardized in 1973 using a group of over 30,000 school 
children drawn from a stratified sample of some 255 schools from the English 
speaking sector in all provinces of Canada. Hence, both national and local norms 
are available for the CTBS. 

Buros (1959, 1972) reports that the split-half reliability coefficients for both 
the Iowa Tests of Basic Skills and the CTBS are very high. The CTBS test 
manual (1974) states that for grade six (level twelve) the split-half coefficients 
using the Pearson product-moment correlation coefficient range from .81 to .93 
for the various subtests. The composite test split-half reliability coefficient using 
the Spearman-Brown formula is .98 at the grade six level. 

The entire test battery requires five hours to complete. The manual 
recommends that testing be done over a four-day period. The reading 
comprehension test and mathematics tests require an hour each. 


The Sample 
The population for the study consisted of students that fulfilled the following 
three requirements: 


1. The students were enrolled in regular grade seven classrooms in the Regina 
Public School system for the 1976-1977 school year. 


2. The students had written the Otis Beta Test in the school year 1973-1974, 
and obtained I.Q.’s in either the superior range (I.Q. 130 plus) or average 
range (I.Q. 99-109). 


3. The students had written the Canadian Tests of Basic Skills during May, 
1976. 


The comparative part of the study involved a sample of 19 intellectually 
superior students and 19 intellectually average students chosen on a random 
basis from the population described above (see Table 1). To obtain the sample 
the writer employed a table of random numbers. The randomly chosen students 
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were then individually tested using the following measuring instruments: (a) 
Wechsler Intelligence Scale for Children-Revised (WISC-R) and (b) Teachers’ 
Inventory of a Pupil’s Self-Concept (TIPS). 


TABLE 1 


WISC-R TEST RESULTS FOR SUPERIOR 
AND AVERAGE STUDENTS 


Two-Tailed 
Variable Group Mean S.D. T Value Probability 
WISC-R Superior 137.63 8.49 13.66* .000 
VERBAL Average 98.47 Or22, 
WISC-R Superior 130.11 8.00 Si 9e .000 
PERFORMANCE Average 105.68 Ce 
WISC-R Superior 137.89 5.87 19.44* 000 
FULL-SCALE I.Q. Average 101.63 5.93 


*Denotes significance at the .05 level of significance. 


Data Analysis 


Raw data were collected in the Regina schools and statistically analyzed using 
the computer program known as the Statistical Package for Social Sciences 
(SPSS) Xerox Version 6.02. For purposes of comparing the mean intelligence 
and self-concept scores, the t-test method of analysis was used. Correlations 
between intelligence and achievement were examined using the Pearson 
product-moment correlation formula. All hypotheses were tested at the p>.05 
level of significance. 


Hypothesis 1 


The mean self-concept score obtaine on the TIPS was 50.53 compared to 
46.90 for the intellectually average grade seven students. When the ¢ test was 
applied, the difference in mean self-concept scores was not significant. Therefore, 
the hypothesis was upheld. See Table 2 for results. 
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TABLE 2 


STATISTICAL DATA COMPARING THE TEST RESULTS 
OF INTELLECTUALLY SUPERIOR GRADE SEVEN STUDENTS 
WITH INTELLECTUALLY AVERAGE GRADE SEVEN STUDENTS 


Group Two-tailed 
Variable (N=19) Mean SD. T Value Probability 
TIPS Superior 50.53 8.20 1.60 .120 
Average 46.90 9.00 
CTBS READING Superior 8.96 63 leaee .000 
COMPREHENSION Average 6.95 99 
CTBS MATH Superior oa 54 6.63* 000 
COMPREHENSION Average 7.20 1.13 
CTBS COMPOSITE Superior 9.02 49 OanOz .000 
ACHIEVEMENT Average veOT wt) 


*Denotes differences in means between the two groups are significant at the .05 level of 
significance. 


Hypothesis 2 

Mean reading comprehension grade equivalent score of the intellectually 
superior grade seven students was 8.96 while the reading comprehension grade 
equivalent score for the intellectually average grade seven students was 6.95. 
Therefore, the hypothesis was rejected. See Table 2 for results. 


Hypothesis 3 

The mean mathematics grade equivalent score of intellectually superior grade 
seven students was 9.00 while the mean mathematics grade equivalent score of 
intellectually average students was 7.2. Therefore, the hypothesis was rejected. 
See Table 2 for results. 


Hypothesis 4 

The intellectually superior grade seven students achieved at the mean 
composite grade equivalent of 9.02 while the intellectually average grade seven 
students achieved at the mean composite grade equivalent of 7.07. The 
hypothesis was also rejected. See Table 2 for results. 

Upon analysis of the correlations of self-concept with the variables of sex, 
intelligence and achievement, no significant correlations were found. 

Correlation between the Otis Beta intelligence scores and the WISC-R scores 
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and the WISC-R Performance Scale scores was .76, while the correlation between 
the Otis Beta intelligence scores and the WISC-R Full-Scale scores was .87. 
WISC-R Full-Scale intelligence scores correlated .88 with the CTBS composite 
grade equivalents. See Table 3 for results. 


TABLE 3 


CORRELATION MATRIX FOR SUBJECTS IN THE STUDY 


1 2 3 4 5 6 i 8 9 
if Sex 
2 Beta I.Q. -.05 
Bi, WISC-R Verbal I.Q. oul Game Soa 
4. WISC-R Performance I.Q. SAH sailor Tete 
Ds WISC-R Full-Scale I.Q. = LOS eo CO aan 
6. CTBS reading comprehension =02) 3.8" 2 87** 91697 sae eo: s 
te CTBS mathematics achievement -.06 .71** .78** .67**  .79** .79** 
8. CTBS composite =02 62°") SE8i >") Steams Sct 1 OD*< aeROoe = 
9. TIPS PAR AO .20 10 18 18 .16 .16 


*Denotes significance at the .05 level. 
**Denotes significance at the .001 level. 


Results of this study demonstrate that while the self-concept scores of 
intellectually superior grade seven students were somewhat higher than the 
self-concept scores of intellectually average grade seven students, the differences 
were not significant. 

Results, however, did indicate that the intellectually superior grade seven 
children achieved at a significantly higher grade level in reading comprehension 
and mathematics. This same group of intellectually superior children achieved at 
a significantly higher overall composite grade level when considering results on all 
the CTBS tests. 

Finally, the self-concept scores of the intellectually superior do not correlate 
significantly with intelligence and achievement. 


Limitations 


‘The study was limited by the nature of the instruments used. However, 
validation and reliability studies have been conducted on each instrument, and 
the instruments have been repeatedly used by researchers for a number of years. 

A second limitation may be that of size of the sample. Factors such as 
attrition, incomplete records, and the researchers’ objective to have the identical 
number of subjects in both groups resulted in a small sample. 
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Conclusions 


Generalizability of the results in this study to other populations of 
intellectually superior children is possible only to the extent to which the 


students in the study represent the larger population of intellectually superior 
children: 


1. Intellectually superior children did not have significantly higher (more 
positive) self-concept than intellectually average children. These results were 
contrary to Sears’ (1965) findings; however, they tended to support the findings of 
Simon and Simon (1975) and Trowbridge (1972). 


2. Relationships between self-concept and achievement concurred with the 
findings of Bishton (1957), Flescher (1963), Lichtman (1971), Lovel and Shields 
(1967), Seagoe (1972), and Terman (1965). The relationship was not significant. 


3. Achievement in mathematics, reading and overall achievement by the 
intellectually superior children was significantly higher than that achieved by the 
intellectually average children. These results support the findings of Bishton 
(1957), Lovell and Shields (1967), Reed (1967), and Seagoe (1972). 
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Effects of Social Relationships Curricula 
and Sex Differences on Reading 
and Imaginativeness in Young Children 


In this study, the effects of social relationships curricula and sex differences on reading 
readiness and imaginativeness in five-year-old children were investigated. Several 
preliminary analyses were run on pretest scores. The results of a 2 x 2 (treatments by 
sex) analysis of variation or interaction effects on I.Q. Results of F,,,, tests on 
treatment-sex conditions (e.g., experimental-girls; control-girls) showed nonsignificant 
differences between groups on both measures. The main analyses consisted of two 2 x 2 
(treatments by sex) analysis of covariance on reading and imaginativeness measures with 
post-hoc tests. The results indicated that children in experimental outperformed those in 
control treatments, girls outperformed boys on reading and imaginativeness, and 
experimental-girls did better than control-boys on imaginativeness. 


Social relationships, regarded by Mand (1977) as the Fourth Basic ’R, has been 
largely ignored in formal school curricula at all grade levels. Social relationships, 
according to Mand, is the affective or social-cognitive capacity of the individual 
to relate to and be able to perceive the “other.” Mand noted that social 
relationships is relegated to recess, free play, social education classes, or 
randomly spaced teacher-guided instruction depending on pupil’s needs. Then 
too, the feeling that systematic planning is more important for cognitive than 
affective growth has long been an issue in early childhood and general curriculum 
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development. Further, when growth of social relationships is seen as the opposite 
of cognitive development, similar to play juxtaposed to work, the importance of 
systematic planning for affective learning further diminishes. The push for the 
Basic Three ’Rs and disagreements over the objectives of schooling have helped 
to reduce the importance of the Fourth ’R in early childhood and curriculum 
programming. 

However, some curriculum developers and planners are reconsidering the 
importance of social relations and other forms of affective curricula. For example, 
Tanner and Tanner (1975) and Ball and Bogatz (1973) on a more general level, 
and Evans (1975), Messick and Barrows (1972), and Palmer, Cazden and Glick 
(1971) for early childhood curricula emphasize planning, implementing and 
evaluating of social relationships curricula and other forms of affective learning 
in school programs. Then too, research results (Curry & Arnaud, 1974: Piaget, 
1972; Singer, 1974) show that “imaging” or the capacity to project mentally the 
feelings and thoughts of others is the intersect between the development of social 
relationships, as one affective component, and cognition. Role play, the capacity 
to change oneself into another person or object, provides the opportunity for 
imaging to occur (Piaget, 1972; Singer, 1974; Smilansky, 1968). 

More specifically, research results of Aspy and Roebuck (1976), Piaget (1972), 
Saltz, Dixon and Johnson (1977), Singer (1974), and Smilansky (1968) show that 
growth of social relationships is linked with development of both affective and 
cognitive processes and is significant to intellectual learning. 


The research results of Aspy and Roebuck (1976) show a high positive 
correlation between students who had high ratings on social awareness and 
empathy and those who scored high on intelligence measures. These same 
students also had teachers who were rated high on personal regard for pupils’ 
classroom performances. Similarly, the results of studies conducted by Saltz, 
Dixon and Johnson (1977) and Smilansky (1968) further: (a) support the 
importance of social relationships; (b) link it with cognitive growth and learning; 
and (c) show the impact of social relationships using role playing on intellectual 
and affective processes in young children. Saltz, Dixon and Johnson (1977) 
reported that children trained in social relationships curricula through role play 
showed greater empathy and feeling for others, demonstrated more individual 
and group impulse control, and made greater gains in cognitive growth than 
those in the control groups. The results of Smilansky’s (1968) research, using role 
play episodes with children from high and low socioeconomic strata (SES), 
indicated that low SES subjects who role played and those who role played and 
received supplemental life experiences significantly changed their social and 
language behaviours toward the high SES (compared to other low SES subjects 
who did not receive role play experiences). The research results (Smilansky, 
1968) of greater social awareness, more interpersonal actions of group members, 
and increased number of original words used in sentences (compared with 
baseline samples) show the importance of role play and its association with social 
relationships and language learning. 
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The research results of Aspy and Roebuck (1976), Saltz, Dixon and Johnson 
(1977), and Smilansky (1968) have: (a) documented the relationships of the 
Fourth ’R to intellectual and affective learning; (b) operationalized social 
relationships using its element of role play; and (c) showed the efficacy of 
developing, implementing, and evaluating social relationships curricula through 
role playing relative to intellectual and affective measures in young children. 


The present investigation focused on the effects of: (a) social relationships 
curricula through role playing on measures of reading achievement and 
imaginativeness; and (b) sex differences on measures of reading achievement and 
imaginativeness in young children. 


Previous research studies surveyed have not explored social relationships 
curricula through role playing: (a) on measures of reading achievement and 
imaginativeness in young children; (b) in conjunction with main effect of sex on 
the measures; and (c) as a Basic ’R that can be implemented and taught as a 
viable part of kindergarten curricular programming. 

Accordingly, the null hypotheses of major concern in the study are that there 
will be no differences: (a) between social relationships curricula and control 
treatments on reading readiness and imaginativeness measures; and (b) between 
girls and boys on reading readiness and imaginativeness measures. 


Procedures 


Subjects 

The subjects were 96 children five years of age. Twenty-four children, 12 girls 
and 12 boys, in each of four intact classrooms in four elementary schools in two 
adjoining suburban school districts were employed in the study. The schools and 
classrooms were chosen because of the cooperativeness of the teachers and 
building principals and because there was an equal number of girls and boys per 
classroom. With the names of the teachers placed on cards and pulled from a 
container (without replacement), two classrooms were randomly assigned as 
experimental and the other two as control treatment groups. All children were 
pulled from a middle class socioeconomic population as represented by the white 
collar occupations of the heads of household: sales and insurance personnel, 
education and nursing fields, secretarial staff, and other representatives of white 
collar professions (Warner, Meeker & Eells, 1960). 


Procedure 

Assessment. The pre- and post-assessment phases were administered in 
September and again in May of the school year. In the pre- and post-assessment 
phase, the children were administered the Gates MacGinitie Reading Readiness 
Skills Test (Gates & MacGinitie, 1968) and the Singer Imaginativeness Inventory 
(Singer, 1974). The Gates MacGinitie is a group test that measures reading 
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achievement across eight skill areas, such as listening and auditory 
comprehension, letter and word recognition. The child’s responses are scored 
correct or incorrect. The Singer is an individually administered test, containing 
10 questions, and assesses the level of imaginative predispositions in young 
children. An example of a stimulus item from the Singer is, “Do you have a 
make-believe friend? What are some of the things you do with your make-believe 
friend?” The child’s answers per stimulus item are then scored on a 1 (i.e., low 
imaginativeness) to 5 (i.e., high imaginativeness) scale. In the pretest (but not in 
the posttest) phase, the subjects were administered the Peabody Picture 
Vocabulary Test: Form B to assess possible preliminary I.Q. differences among 
the four treatment groups. The treatments began after termination of the pretest 
phase. 

Treatment. The duration of the experimental and control treatments was 15 
minutes per day—five days per week across seven consecutive months. The 
treatments were administered immediately following the kindergarten 
mathematics readiness program rather than reading readiness to reduce possible 
“carry-over” effects of reading content into experimental and control conditions. 
To counterbalance for the time of treatment administration, one experimental 
treatment occurred in the morning and the other in the afternoon session, and in 
similar fashion the control treatments were divided between morning and 
afternoon, given full-day kindergarten programs. 


In the experimental treatment, the two teachers implemented social relations 
curricula through role taking. Basically, they were trained to employ eight basic 
strategies for developing and using role play with young children. The strategies, 
described in detail elsewhere by Yawkey (in press), focused on: (a) setting up role 
play experiences in individual and small groups (of no more than five 
participants); (b) using shared communication between peers and between the 
adult and child for role taking experiences; (c) employing incidental learning 
experiences as role activities; and (d) utilizing children’s stories read to pupils as 
grist for role play activities. From the perspectives of social relationships, the 
experimental treatment emphasized feelings, concerns and interests of the 
students, perceptions of the other, cooperation, helping, and individual and social 
group problem-solving through role enactment. Essentially, the students in the 
experimental group were required to view themselves in relation to the other. 


In the control treatment, the two teachers used ordinary paste and cut and 
art and craft activities. The activities in the control group were completed on an 
individual basis by children working at their desks. The type and form of the 
experiences helped minimize child-child interaction while at the same time 
emphasizing work products and their completion. In addition, the activities in 
the control treatment were motor based and required the children to use small 
and large muscles and motoric movements. The paste and cut and art and craft 
experiences were largely based on holiday themes and special events that 
occurred in the elementary school during the weeks and months. A secondary 
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source for the activities was art and craft books from which various independent 
seat-work activities were selected by the teacher. 


Analysis 

A number of analyses were initially conducted on the pretest measures. For 
the initial pretest analysis, the results of the variables of treatment groups (i.e., 
experimental versus control) and sex (i.e., boys versus girls) on Peabody Picture 
Vocabulary Test (P.P.V.T.) scores, using a two-way analysis of variance to test 
null hypotheses of no differences on P.P.V.T.-I.Q. between and within conditions, 
showed that neither main nor interaction effects were significant (i.e., for 
treatments, F = 1.22, df = 1/92, p > .05 and for sex, F = 1.95, df = 1/92, p > 
.05). 

For the Gates MacGinitie, the raw scores ranged from 0 to 142 and for the 
Singer, 10 to 50 (with inter-rater agreement of .93 for the pretest and .98 for the 
posttest). Two other analyses, prior to treatment, were run on the Gates 
MacGinitie and Singer pretest socres. Hartley’s F,,,, test, as described by Winer 
(1962), was used across four treatment-sex conditions (i.e., experimental-girls, 
control-girls, experimental-boys, and control-boys) on pretest Gates MacGinitie 
and for pretest Singer scores. The F,,,, test for homogeneity of variance was used 
to determine whether the subjects in the treatment-sex conditions differed from 
one another on Gates MacGinitie and Singer scores prior to treatments. The 
results of the F,,,,, test showed that the subjects did not differ significantly across 
treatment-sex conditions on the Gates MacGinitie and Singer measures. 
Preliminary analyses showed that across the four classrooms, the subjects in the 
treatments by sex conditions on P.P.V.T. and treatment-sex conditions on Gates 
MacGinitie and Singer measures were homogeneous relative to the tests and 
treatment groups employed. 

For pre-posttest analyses, the variables of (a) experimental and control 
treatments and (b) girls and boys were crossed orthogonally implying a 2 x 2 
factorial analysis of covariance as described by Winer (1962). Two separate 
analyses, one on the Gates MacGinitie and the other on Singer were run with 
pretest scores serving as the covariate variable and the posttest scores the 
criterion variable for each set of measures, respectively. 

On the Gates MacGinitie measure, the results of the analysis showed 
significant main but no significant interaction effects. On the Singer measure, the 
results of the analysis evidenced significant main and interaction effects. The 
summary table for the analysis for covariance on the Gates MacGinitie and 
Singer measures are shown in Tables 1 and 2, respectively. The pre- and 
post-group means by treatment on the Gates MacGinitie and Singer measures are 
illustrated in Table 3. 
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TABLE 1 


SUMMARY FOR ANALYSIS OF COVARIANCE FOR MAIN AND 
INTERACTION EFFECTS ON GATES MacGINITIE SCORES FOR 
TREATMENT AND SEX 


Source df SS MS F 
Treatment 1 160892.52 160892.52 04.71* 
Sex ii 431491.68 431491.68 127714 
Treatment x Sex 1 4921.02 4921.02 1.45 
Residual 92 33941.58 1493429.70 
*n—<—.05 
TABLE 2 


SUMMARY FOR ANALYSIS OF COVARIANCE FOR MAIN AND 
INTERACTION EFFECTS ON SINGER SCORES FOR TREATMENT AND 


SEX 
Source df SS MS F 
Treatment i) 30.08 30.08 8.48* 
Sex 1 21533 Bisse Gols 
Treatment x Sex 1 56.33 56.00 15.87* 
Residual 92 156.17 3.00 
* pea .05 


Using a Scheffé post-hoc statistic on significant main effects of treatments 
and sex on the Gates MacGinitie scores the results indicated: (a) social 
relationships treatment group yielded significantly higher mean scores than the 
control and (b) girls yielded significantly higher mean scores than the boys. The 
Scheffé post-hoc statistic on the results of the Singer analysis showed: (a) social 
relationships treatment group yielded significantly higher mean scores than the 
control; (b) girls yielded significantly higher mean scores than the boys; and (c) 
the girls in the social relationships group yielded significantly higher mean scores 
than the boys in the control condition. 
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TABLE 3 


PRE- AND POSTTEST GROUP MEANS BY TREATMENT 
ON GATES MacGINITIE AND SINGER MEASURES 


Pretest Group Means Posttest Group Means 
by Treatment by Treatment 
Experimental Control Experimental Control 
Measures xX x x xX 
Gates- MacGinitie 55.02 49.91 67.10 55.01 
Singer 8.91 8.63 13.90 11.71 
Discussion 


The investigation explored the effects of social relationships curricular 
compared to control treatments on measures of reading readiness and 
imaginativeness. The major results of the study were: (a) on reading readiness 
measures, social relationships curricular group yielded significantly higher mean 
scores than the control treatments and girls significantly outperformed boys; (b) 
on the measures of imaginativeness, social relationships curricular group yielded 
significantly higher mean scores than the control treatments and _ girls 
outperformed boys (across the groups); (c) on imaginativeness the girls in the 
social relationships performed significantly better than boys in the control 
conditions. 


The results of the study, supporting previous research (Aspy & Roebuck, 
1976; Saltz, Dixon & Johnson, 1977; Smilansky, 1968), indicate that social 
relationships was a facilitator of cognitive learning of reading readiness concepts. 
Aspy and Roebuck (1976), using I.Q. measures, and Piaget (1972) with cognitive 
tasks, have both recognized the association between social relationships and 
cognitive tasks. Saltz, Dixon and Johnson (1977) note that social relationships 
through role play appeared to facilitate the identification process of the young 
child with the story characters and situations and increased motivational 
properties relative to intelligence and impulse control. Further, role play, 
permitting the children to feel, act, and think like the characters in the story 
passages, helped them to relate to the situations in the episode and facilitated 
their understandings of story content and concepts (Saltz, Dixon & Johnson, 
1977). Smilansky’s results (1968), in addition, indicate that imaginative play 
increased children’s language learning (relative to baseline measures) on: (a) the 
number of sentences and non-repeated words used in them and (b) the frequency 
of using contextual clues to derive word, sentence, and paragraph meaning. 
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On group reading achievement measures, the fact that differences between 
boys and girls appeared after seven months of formal reading readiness 
instruction was consistent with the results of other investigations (Duncan, 1976; 
Klein, 1977; Stanchfield, 1970). Cultural, biological, and psycholinguistic theories 
have attempted to explain observed differences favouring girls over boys in early 
formal reading instruction. The effects of treatments in the present study 
apparently “washed out” given large observed differences favouring girls in 
reading. 

The results on the Singer showed that the imaginative predispositions of 
young children significantly increased through social relationships training. 
Although the same amounts of classroom time were given to treatment 
conditions, children in the social relationships curricula using role play received 
significantly higher scores on imaginativeness than those in the control treatment. 
According to Piaget (1972), role play as a cognitive imaging process requires 
imaginativeness in which the child (a) suspends reality, in differing degrees, to 
achieve personal and/or social group goals and (b) uses make-believe in greater or 
lesser degrees to reconstruct reality. Accordingly, role play could have provided 
an escape from reality and assisted flexibility and originality of thought. 
Feitelson and Ross (1973) and Saltz, Dixon and Johnson (1977) have reported 
similar effects for role play on imaginativeness and originality of thought. 


On the Singer inventory, girls did significantly better than boys on 
imaginativeness across groups. Further qualifying the results was a significant 
interaction effect which showed that girls in social relationships significantly 
outperformed boys in the control group. The results of Garvey (1977), Jones 
(1972), McCune and Mathews (1976), and Millar (1976) also showed performance 
differences between girls and boys in imaginativeness and role play preferences, 
quality of make-believe, types of play, and quality and quantity of language 
generated during play. Specifically, Garvey’s results indicate that verbal abilities 
in role play and general language performance in formal academic settings were 
greater for young girls than for young boys. Conceivably, the task required in 
social relationships treatment group could have been more conducive and 
facilitative for girls than for boys, considering its emphasis on verbal language 
(compared to the control condition). Also, the results of Jones and of McCune 
and Mathews showed that young girls preferred play forms with verbal rather 
than physical elements. 


There were a number of distinct limitations to the study: (a) student 
populations and their SES levels used in the research; (b) measures employed in 
assessing behaviours; and (c) the procedures employed in the treatment groups. 


Further research might find solutions to the following extended questions: 


1. What differential effects, if any, would child-generated versus adult-guided 
social relationships curricula have on reading readiness curricula and 
imaginativeness of young children? 
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2. What differential effects, if any, would verbal, verbal-motor, and motor 
forms of role play as rehearsal conditions have on reading readiness and 
imaginativeness of girls compared to boys? 


3. What differential effects, if any, would fantasy versus reality based role play 
as rehearsal conditions have on reading readiness and imaginativeness of 
young children? 


The author expresses appreciation to Research and Program Development Division, The Economy Company 
Educational Publishers, Oklahoma City, Oklahoma for providing funds for the research investigation. 


Portions of the paper were presented at the Annual Meeting of the American Educational Research Association, 
San Francisco, 1979. 
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Children’s Knowledge About Skimming 


Grades 4, 6, and 8 children were interviewed to investigate their awareness of skimming 
techniques. Their strategic search behaviour for specific pieces of information in 
passages was also observed. Many children at all grade levels were able to think about 
(a) what the opening sentence generally does for a paragraph, and (b) how the 
sought-for information may be expressed in a passage. Using this knowledge (a and b), 
virtually all of the grade 8 but only one-half to two-thirds of the grades 4 and 6 
children were able to describe “how to skim.” Children in all three grade levels could 
skim when explicitly instructed to do so, but spontaneous skimming as a strategy based 
on implicit task requirements was most clearly observed at the grade 8 level. It was 
concluded that the development of knowledge about skimming is an_ increasing 
integration of various pieces of knowledge concerning learning materials, means-end 
relationships, and implications of specific task demands. 


Children’s ability to think about their own cognitive processes (generally 
known as metacognition) has become an important topic in the investigation of 
cognitive development (e.g., Brown, 1978; Flavell, 1978; Flavell & Wellman, 1977). 
Many of the existing studies have centered on children’s knowledge about their 
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memory abilities and the strategies they use for the learning of simple memory 
tasks, e.g., recall of a list of pictorial items (see review articles by Brown, 1978; 
Flavell & Wellman, 1977). In contrast, relatively little is known concerning 
children’s awareness of many other strategies that are likely to be used in the 
acquisition of meaningful information common to school requirements (Brown, 
1978; Flavell, 1977), although some recent explorations have been made in this 
extended direction (e.g., Brown & Smiley, 1978; Brown, Smiley & Lawton, 1978; 
Danner, 1976; Myers & Paris, 1978). 

In the present study, the development of knowledge about one locational skill 
was investigated, viz., the search for information in a prose passage. The 
importance of such a skill in school learning has been emphasized by a number 
of experts on reading and study strategies (e.g., Spache & Spache, 1977; Thomas 
& Robinson, 1975). This emphasis is understandable considering the fact that 
today new information is discovered and accumulated with increasing rapidity. A 
single classroom textbook can no longer be regarded as a sufficient and 
up-to-date source of information for most content areas (e.g., social studies, 
science). Students’ success in school learning, therefore, is increasingly more 
dependent on their knowledge as to where and how to locate new information 
efficiently (Thomas & Robinson, 1975). In view of this, the present study should 
provide useful data for our understanding of the development of “ecologically 
valid” study strategies in children. 


Efficient search for information depends on various pieces of knowledge. 
Students, for example, need to know the existence of such reference materials as 
the dictionary, encyclopedia, and periodicals, and when and how to use them. 
They need to know how to use parts of a book (e.g., table of contents and 
indexes) as aids in locating relevant sections. They further need to know and use 
the skill of skimming to find specific pieces of information within the relevant 
sections. While some normative data for children’s knowledge about reference 
materials and book parts are available (e.g., King, 1968), comparable data are 
unavailable for the development of skimming skills. For this reason, from among 
various locational skills, the strategy of skimming was investigated. 


Imagine now that children have located a book with sections containing 
information that bears upon their research problem. How do they proceed to find 
pertinent information in these pages? One piece of knowledge that children may 
learn is that, in looking for a single fact, the reading of entire paragraphs is 
unnecessary. Instead, they learn they can glance swiftly down the pages, skipping 
everything but target information. This type of effective search involves at least 
two pieces of knowledge: (a) opening sentences of a paragraph often introduce its 
general content and thereby indicate whether or not the sought-for information is 
likely to appear in it, and (b) sought-for information is likely to appear in the 
form of key words, numerical figures, symbols, etc., and a search can be made 
quickly by skipping all but these predetermined “clues” (Spache & Spache, 1977; 
Thomas & Robinson, 1975). The investigation of these children’s pieces of 
knowledge about when and how to skim provided the major focus of this study. 
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In the present study, both behavioural and verbal tasks were used. In 
previous interview studies, children generally showed better metamemorial 
“ability” when they actually experienced in memorizing items (e.g., Moynahan, 
1973). Thus, to assess fully children’s awareness of how to skim, they initially 
were presented with tasks requiring skimming strategies (behavioural tasks). The 
behavioural tasks were also included to provide some data concerning children’s 
knowledge about when to skim. Under the behavioural tasks, children were asked 
to read four comparable short stories under four different instructions. The first 
condition was designed to ensure that the children would read the whole story 
word-by-word and thereby provide a baseline measure of reading speed. The 
second and third conditions were designed to assess children’s spontaneous 
disposition to use knowledge about paragraph structure and key words, 
respectively, for effective skimming. Under each of these two conditions, the 
children were told to find one specific fact in a passage. If children are 
knowledgeable about when to skim, stories would be read faster under the second 
and third conditions than under the first. In addition, even if children have 
appropriate knowledge about skimming strategies, they may not show 
differentiated reading behaviours under the present tasks for at least one reason. 
They simply may not have the required skills to carry out what they know. To 
determine this possibility, the fourth condition was used under which the 
children were given key words and were explicitly instructed how to use these to 
locate a piece of information. 


Following the completion of the third behavioural task described above, a 
series of interview questions (verbal task) were presented to the children to 
assess their verbalizable knowledge about “how to skim” efficiently using the 
paragraph structure and key words. Attempts were made by these questions to 
test the lower limits of children’s understanding of such strategies. 

Children of grades 4, 6, and 8 participated in this study. The grade 4 children 
were used because children at this grade level begin to learn various locational 
skills, including the skill of skimming. Teachers concentrate on such skills 
throughout the subsequent grade levels. Thus, one might expect a developmental 
change to occur in children’s knowledge about skimming within this age range. 


Method 


Subjects 

Subjects were 24 grade 4 (12 boys and 12 girls), 24 grade 6 (12 boys and 12 
girls), and 14 grade 8 (8 boys and 6 girls) children from a parochial school’ in a 
predominantly lower-middle and middle class suburb of Windsor, Ontario. The 
mean chronological ages were 9-11, 12-1, and 13-11 for grades 4, 6, and 8, 
respectively. According to their teachers’ judgments, none of these children 
should have had difficulty in reading the materials used in this study. 
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Materials 

Prose passages. Four sets of factual prose passages, each at the grade 3.8 
readability level (Spache, 1953), were prepared for the behavioural tasks. Two 
sets were about bears and two about cities. The materials were similar to those 
found in a children’s encyclopedia. 

Sets 1 and 2 each contained two paragraphs. In set 1, various characteristics 
of the brown bear were described in the first paragraph while characteristics of 
the polar bear were described in the second. The first paragraph was used to 
obtain the baseline measure and the second paragraph to acquaint the children 
further with the paragraph structure. In set 2, the major features of the black 
bear were described in the first paragraph and those of the grizzly in the second. 
The general content of each paragraph was introduced by the first sentence so 
that children could completely skip a task-irrelevant paragraph (e.g., black bear) 
and proceed to the second one (e.g., grizzly bear) if asked to locate a specific 
answer about the grizzly. The amount of time allocated by the children on these 
first paragraphs constituted the critical behavioural measure. Thus, the length of 
the first paragraphs in each of the two sets was equated, both containing 121 
words. 


Each of the remaining two sets consisted of a single paragraph containing 
various pieces of information about Vancouver (set 3) or New York City (set 4). 
The target information requested, the average temperature of each city during 
the winter, was located at the 121st word, and was expressed in numerical form 
followed by the symbol for temperature degree. No other temperature 
information or numerical figure appeared in either of the paragraphs. All four 
sets were made so that the experimenter could measure the speed with which 
children read the same length of the material (i.e, 121 words) under four 
different conditions. 


Each of the four sets was typewritten on plain white paper (22 x 28 cm) with 
each paragraph printed on a separate page. Each set was presented separately 
with a blank paper as the cover page. 


Interview questions. Two major sets of interview questions were prepared for 
the verbal tasks. One set was prepared to assess children’s knowledge about 
paragraph structure for use in efficient skimming while the other was prepared to 
assess knowledge about the use of key words. These interview questions are given 
verbatim in the Results and Discussion section. 


Procedure 


The children were tested individually in a quiet room. Testing was carried out 
in three phases. In phase 1, behavioural measures (reading speeds) were obtained 
under different task demands to provide evidence for children’s awareness of 
when to and when not to skim. In phase 2, the children’s verbalized knowledge 
about skimming strategies was assessed by the verbal task. Finally, in phase 3, 
the children’s basic skills to perform skimming strategies were examined by the 
behaviour task. 
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Phase 1. The experimenter began this phase by saying: “Today I have some 
stories for you to read and a few questions I would like to ask you.” Having 
assured that the children knew that there were four kinds of bears in North 
America, the experimenter gave the children set 1 to obtain the baseline measure 
of reading speed (hereafter called whole story condition). The following 
instructions were used: “I want you to read every line so that you have read 
everything that is in the story. The story is written on two separate pages. When 
you have finished with the first page, go right on to the second one.” The speed 
with which the children read the first paragraph was measured by using a 
Hunter 1910 timer to the nearest second. The experimenter manually activated 
the timer when the children turned the blank cover page and then stopped it 
when they turned the page containing the first paragraph. 


The experimenter then proceeded to the second task in which the children 
could skim efficiently by using the opening sentence of a paragraph (paragraph 
condition). They were told: “This time, I want you to find just one thing from 
the story. Where does a grizzly bear usually live?” The children were also told to 
underline the sentence that contained the requested information. To ensure that 
the children would remember their task, they were asked to repeat what they 
were searching. The children then began the second task. The amount of time 
required to read the first paragraph of set 2 was measured. Since the first 
paragraph contained material on the black bear, this could be skipped if the 
children were skimming efficiently. When the children finished underlining the 
target sentence, they were asked how they read the story to find the answer. 


The experimenter then initiated the third task in which skimming for key 
words could locate the target information (keyword condition). The children were 
again told to “find just one thing: How cold is it in Vancouver in the winter? ” 
Again they were told to underline the target sentence, to repeat what they were 
seeking, and then given set 3. The experimenter measured from the time they 
turned the first page until they began to underline the sentence. They were then 
asked how they read the material to find the answer. 


Phase 2. Immediately following the completion of the three behavioural tasks, 
the children were interviewed with the two major sets of questions. These items 
were presented in the same order (as seen in Results and Discussion) for all 
children. The experimenter emphasized that there were no right or wrong 
answers. If a child was unable to answer a question, it was repeated or rephrased 
until some answer was given. The entire interview session was tape-recorded. 

Phase 3. Finally, the experimenter told the children how to use key words to 
locate information quickly (explicit condition). The children were asked to find 
“just one thing” from set 4: “How cold is it in New York City during the winter?” 
They were told to find the answer “as quickly as you can by looking for such 
words as temperature, a number, or the degree symbol.” As in the other 
behavioural tasks in Phase 1, measures were taken from the time the children 
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turned the first page until they began to underline their answer. When the 
children were finished with the final behavioural task, they were thanked for 
their participation in the study and then sent back to their classroom. 


Scoring 

Each child’s verbal responses were transcribed into a written account from 
the tape-recorded interview. Lengthy responses were reduced to short summaries 
that were semantically equivalent to the original reports. Several categories for 
each interview item were then generated that could be used to classify the 
children’s responses. Based on these categories, two judges independently scored 
the responses for each of the protocols. Interjudge reliability ranged from 93 to 
100%. Scoring disagreements between the judges were resolved through mutual 
agreement. 


Results and Discussion 


Behavioural Data 

The amount of time required by each child to read 121 words under each of 
the four conditions (whole story, paragraph structure, key word, and explicit) was 
obtained as behavioural measures. The time data obtained under each of these 
conditions are summarized in Table 1. A 8 (grade) x 4 (condition) analysis of 
variance performed on these data yielded significant main effects for grade, F 
(2,59) = 29.37, and for condition, F (3,177) = 152.07, p < .001. The different 
instruction conditions produced different effects on reading speeds at the three 
grade levels and, consequently, a significant grade x condition interaction was 
observed, F (6,177) = 6.22, p < .001. 


TABLE 1 


MEAN AMOUNT OF TIME SPENT ON THE PROSE PASSAGES 
UNDER THE FOUR CONDITIONS 


Condition 
Whole Paragraph Key 
Story Structure Word Explicit 
Grade 4 55.10 48.17 53.43 13.42 
Grade 6 46.99 36.08 40.64 11.90 
Grade 8 Sato 18.59 20.81 (Eee 


Note. Time given in seconds. 
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Further analyses of the significant interaction in terms of Duncan ’s multiple 
range test revealed: (a) The grade 4 children did not differ in reading speeds 
under the whole story, paragraph structure, and key word conditions (p > .05). 
They read the story significantly faster only under the explicit condition when 
compared to the whole story condition. (b) In contrast to the grade 4 children, 
the grade 6 children read the material faster under the paragraph structure 
condition than under the whole story condition. Although the mean reading 
speed for the key word condition differed only marginally from that for the 
whole story condition, this was mainly due to the fact that one child had 
unusually long reading time (109.3 sec) under the key word condition. (c) At the 
grade 8 level, the children read the passages faster when they were told to find 
“just one thing” (i.e., paragraph structure and key word) than when they were 
told to read the “whole story,” p < .01. As can be seen in Table 1, these children 
also benefited from the explicit instruction to use key words. 

Also, the significant interaction of grade and condition indicated that the 
reading speeds did not differ across the three grades under the explicit condition, 
although the older children read the materials with significantly faster speeds 
than the younger children under the remaining three conditions, F (2,59) = 
22.74, 31.41, and 16.44, p < .001, for the whole story, paragraph structure, and 
key word conditions, respectively. It may be added that all of the children 
underlined target sentences correctly under all of the three task conditions, 
indicating that these children recalled the task demands while reading and 
verified the answer when it was found. 

The observed differences in reading speeds between the conditions of whole 
story and paragraph structure or key word, suggest that children, as they grow 
older, become increasingly more knowledgeable about when to skim. This 
observation is consistent with the previous researchers’ (e.g., Myers & Paris, 
1978) conclusion that grade 6 and older children have general knowledge that 
they should read stories differently depending on task goals. Many of the 
younger children failed to modify their reading speeds when asked to locate one 
specific piece of information. This may be partly because some of these children 
were insensitive to implicit task demands and partly because others did not know 
how to skim. It is clear, however, that even grade 4 children have basic skills to 
carry out skimming strategies required by the key word condition (data for the 
explicit condition). 


Verbal Data 

Paragraph structure. To investigate children’s knowledge about the use of 
the first sentence of a paragraph to search for information in a passage, a general 
question was asked first (item 1). It was general in that the word “paragraph” 
was not used in the question. 

1. “Suppose you want to teach one of your classmates how to find out from 
this story (referring to the story about bears) where a grizzly bear usually lives. 
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How would you teach them to find the answer quickly?” Table 2 shows how 
children of different grade levels responded to this item. Responses were labelled 
“specific skimming” if they specified how another child could skim the story (e.g., 
“Tell him to skim for the word live.” “First, you look for the word grizzly. And 
when you find the part about the grizzly bear, then you look for the word live.”). 
Responses were categorized “nonspecific skimming” if they said that they would 
tell a classmate merely to “read fast” or to “skim for the answer.” Any response 
that did not indicate a method for finding the answer quickly was classified as 
“other forms of reading” (e.g., “Read everything in the story.” “Tell him to read 
the story carefully”). 


TABLE 2 


PROPORTION OF CHILDREN AT EACH GRADE LEVEL GIVING VARIOUS 
TYPES OF ANSWERS TO ITEM 1 (PARAGRAPH STRUCTURE) 


Specific Nonspecific Other Forms 

Skimming Skimming of Reading 
Grade 4 .50 .00 .50 
Grade 6 54 .25 eh! 
Grade 8 93 07 .00 


As can be seen in Table 2, one-half of the grades 4 and 6 children and 
virtually all of the grade 8 children specified how one may use skimming 
strategies to search for a piece of information in written materials. None of the 
children, however, explicitly mentioned how one may use the opening sentence of 
a paragraph to skim strategically. Six grade 8 children, but none of the grades 4 
or 6 children, mentioned a “two-stage” skimming strategy (first grizzly, then live), 
as is illustrated by the second example of the “specific” category. The data 
summarized in Table 2 were analyzed by forming three 2 (e.g., grade 4 vs. grade 
6) x 2 (specific skimming vs. combination of the remaining 2 categories) tables.’ 
The observed age differences between grades 4 vs. 8 and grades 6 vs. 8 were 
statistically reliable x? (1) = 5.44 and 4.47, respectively, p < .05. When the 
“specific” and “nonspecific” categories were combined, there was a trend for more 
of the grade 6 (79%) than the grade 4 (50%) children to say “skim” (p < .10), 
suggesting that more of the children in the middle-aged than in the youngest 
groups had at least a general idea about what activity was required. In this 
further analysis, the difference between the grades 6 (75%) and 8 (100%) was 
not significant. 
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While none of the children explicitly referred to the use of paragraph 
structure for the purpose of efficient skimming, this may not necessarily mean 
that such knowledge is absent. The idea might not have occurred to them with 
the general question. In order to determine this possibility, the following two sets 
of questions (items 2 and 3) were asked. Item 2 was included to introduce item 3 
to children. 


2. “Some students tell me that when they are looking for just one thing in a 
story, they do not read all of the paragraphs. They say that they skip some of 
the paragraphs. Do you ever read this way when you are looking for a piece of 
information?” Fifty-four percent of the grade 4, 71% of the grade 6, and 93% of 
the grade 8 children indicated that they sometimes skip paragraphs. The analyses 
of the data showed that only the difference between the grades 4 and 8 was 
significant, x’ (1) = 4.47, p < .05, revealing that more of the Grade 8 than grade 
4 children affirmed the use of skimming strategies. 


3. “How do you (or these students) find out which paragraphs you (or they) 
can skip? How do you (or these students) find out which paragraph you (or they) 
should read?” With this item, 37% of the grade 4, 50% of the grade 6, and 80% 
of the grade 8 children said that they would decide whether or not to read the 
whole paragraph by reading the first few sentences of the paragraph (e.g., “I 
would read the first sentence.” “I would read the first sentence and then skim to 
see if there were any words related to my topic in it.”). The analyses of the data 
by forming the four-fold contingency tables (e.g., grade 4 vs. grade 6 x paragraph 
structure vs. other responses) indicated that significantly more of the grade 8 
than grade 4 children referred to the use of a paragraph structure for deciding 
the relevancy of a particular paragraph, x’? (1) = 6.48, p < .02. The difference 
between the grades 8 and 6 was marginally significant (p < .10), while the 
difference between the grades 4 and 6 was not. 


Does the age-related change observed in the children’s responses to item 3 
reflect the possibility that more of the older than the younger children were 
aware of the function of the opening sentence of a paragraph? To examine this 
question, the children were presented with item 4. 


4. “What information does the first sentence of a paragraph usually tell you?” 
Recently, Myers and Paris (1978) concluded that a majority of grade 6 children 
are aware of the function of the first sentence of a paragraph. This conclusion 
may be extended to even younger children. Thus, 71% of the grade 4, 79% of 
the grade 6, and 93% of the grade 8 children said that the first sentence 
indicates what the content of a paragraph will be (“They tell you what you'll find 
out in that paragraph.” “They tell you what the paragraph’s going to be about.”). 
None of the comparisons between different grade levels was statistically 
significant. The findings for items 3 and 4 suggest that both younger and older 
children are equally aware of the function of the first sentence of a paragraph, 
but older children are more likely than younger children to apply this knowledge 
to situations of information search. Among those children who knew the function 
of the first sentence (item 4), 41% of the grade 4 children, 63% of the grade 6 
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children, and 85% of the grade 8 children were able to provide adequate 
responses to item 3. 

Key words. To investigate children’s knowledge about how to use key words 
to locate a piece of information in a passage, item 5 was prepared. 

5. “When you look for the temperature of Vancouver during the winter, is 
there any way you can find this information right away?” Table 3 summarizes 
how children responded to item 5. Responses were classified as specific skimming 
if the children said that they would skim for specific key words or symbols to 
find the answer quickly (e.g., “Skim the story for a number and the degree 
symbol.” “Look over the story quickly for the word temperature.”). Responses 
were scored as nonspecific skimming if the children merely said they would “read 
fast.” Responses that did not indicate a method for locating the answer quickly 
were classified under “others” (“I just read the story.” “I read everything in the 
story.”). As can be seen in this table, the proportions of the children in the 
specific skimming category increase as a function of age. The analyses of the data 
indicated that only the difference between grades 8 and 4 was significant, x? (1) 
= 4.47, p < .05. 


TABLE 3 


PROPORTIONS OF CHILDREN AT EACH GRADE LEVEL 
GIVING VARIOUS TYPES OF ANSWERS TO ITEM 5 (KEY WORD) 


Specific Nonspecific Other Forms 

Skimming Skimming of Reading 
Grade 4 54 00 46 
Grade 6 19 .08 3 
Grade 8 93 nO .0O 


About one-half of the grade 4 and one-fifth of the grade 6 children did not 
refer to the use of key words spontaneously in response to interview item 5. This 
finding may be due to the inability of these children to anticipate the form in 
which the answer may be expressed. The children’s responses to the following 
question (item 6) indicate that this was not the case for the majority of the 
children. 

6. “What might give you a clue where this information (temperature of 
Vancouver) could be located? What other clues might help you to find this 


information quickly?” When only the first responses given to item 6 were 
considered, 83% of the grade 4, 100% of the grade 6, and 93% of the grade 8 
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children were able to give a good key word (e.g., temperature, degree symbol, 
number). A comparison of “presence of a good key word” and “absence of a good 
key word” across grades yielded a nonsignificant result, indicating that children 
at all three grade levels are equally likely to think of a good key word for this 
question. 


The children were encouraged to think about as many “good clues ” as they 
could. The mean number of good clues generated were 2.2, 2.9, and 2.9 for grades 
4, 6, and 8, respectively. These scores were not statistically different. The data 
for item 6, therefore, suggest that after grade 4 there may not be age-related 
differences in children’s ability to anticipate how the information may be 
expressed in a passage if one uses tasks similar to the present one. 


Of those children who could anticipate how the sought-for information might 
be expressed in a passage, 65% of the grade 4, 79% of the grade 6, and 100% of 
the grade 8 children used this knowledge to provide the specific skimming 
responses to item 5. This developmental trend is quite comparable to that just 
described for items 3 and 4 (see the section for item 4). 


Conclusion 


Based on the results, what can be said regarding the development of 
children’s knowledge about skimming strategies? The discussion will be focused 
on this question. 

The use of the strategies considered in this study requires several pieces of 
knowledge and cognitive skills. First, a child must know (a) what the opening 
sentence generally does for a paragraph and (b) how the sought-after information 
may be expressed in a passage. The verbal data indicate that a majority of grade 
4 children have such basic knowledge (data for items 4 and 6). Second, a child 
must keep in mind what he/she is looking for while looking swiftly over the page 
and evaluate the answer when it is found (Spache & Spache, 1977). The present 
study shows that grade 4 children have such skills necessary for performing these 
skimming strategies, as all of the grade 4 children correctly underlined 
task-relevant sentences under all of the search conditions. In addition, these 
children located a specific piece of information as quickly as the grade 8 children 
when they were explicitly instructed to skim for key words. It appears, then, that 
many children will acquire the basic knowledge and cognitive skills investigated 
here by the time they finish grade 4. 

There are some aspects of knowledge about skimming that show developmental 
changes. Researchers of cognitive development frequently have pointed out that 
children acquire knowledge about learning materials and skills, but only at a 
later point of development do they become aware they can make use of such 
previously acquired knowledge to solve new or different problems (e.g., Brown, 
1975; Flavell, 1970). In an analogous manner, we have found in the verbal data 
that as many grade 4 as grade 8 children were aware that the target information 
for “How cold is it in Vancouver?” was related in some way to a degree symbol 
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and a numerical figure. However, only half of the grade 4 but all of the grade 8 
children were able to think that they could facilitate the process of information 
search by using their knowledge about how information may be expressed (see 
Table 3). A similar pattern was observed in the data regarding the use of a 
paragraph structure (items 3 and 4). These findings suggest that children 
between grades 4 and 8 gradually become more sensitive to parameters affecting 
the process of information search in a written passage, forming knowledge about 
how to skim. 

An examination of both the verbal and behavioural data suggest that there is 
something else that may develop between grades 4 and 8. According to the verbal 
data, approximately one-half of the grade 4 children could describe skimming 
techniques explicitly (data for items 1 and 5). The behavioural data indicate, 
however, that only a small proportion of these knowledgeable children actually 
used such strategies when they were asked to find a single fact in a passage. This 
observed discrepancy between the verbal and behavioural data suggests that 
there were children at the grade 4 level who had the knowledge about how to 
skim, but were not sensitive to an implicit task demand to modify their reading 
strategies. In contrast, the grade 8 children not only articulated how to use 
skimming strategies (verbal data), but also spontaneously used the knowledge 
according to the requirements of the task. These data suggest that part of 
knowledge about skimming consists of knowing when one should deliberately use 
the knowledge of how to skim, which many grade 4 children may have to acquire. 


We have seen in the preceding paragraphs three sets of data showing 
different patterns of age-related changes in children’s performance. It can be seen 
that tasks corresponding to those data require different degrees of information 
integration. In relation to the question posed at the beginning of this section, 
these three sets of data converge to suggest that the development of knowledge 
about skimming is an increasing integration of various pieces of knowledge 
concerning, for example, learning materials, means-end relationships, and 
implications of specific task demands. 


Finally, the grade 8 children displayed mature performance throughout the 
present experimental session. It should not be concluded from these findings, 
however, that the development of knowledge about skimming and the use of that 
knowledge for information search is completed by the time children reach this 
grade. As already mentioned, to skim effectively presupposes that children know 
what is being sought. Whether or not students are able to determine the kind of 
information they are looking for depends partly on the nature of the retrieval 
questions and the reading materials. For example, children who are able to 
decide the kind of information needed to answer the questions used in this study 
(“How cold is it in Vancouver?”) may not necessarily be able to do the same for 
such questions as: “Why is Samuel de Champlain often called ‘the Father of 
Canada’”?” From among a virtually infinite number of situations requiring 
locational skills, only two instances of locating by skimming were examined here 
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using grade 4 level reading materials. While attempts were made to test the 
lower limits of children’s knowledge about skimming, attempts need to be made 
in the future to test the upper limits of such knowledge in older children. 


Notes 


1. The reading program of this particular school is similar to that practised at other schools in the Essex 
County School System of Ontario. Teachers use materials related to the Nelson Language Development 
Reading Program. At grade 6 and higher levels, this program is, to some extent, supplemented by SRA 
(Science Research Associates) studybooks. At the time this research was conducted, a lesson plan specifically 
intended to teach “when” and “how” to skim and the utility of such techniques was not part of the grade 4 
teachers’ curriculum. When grade 4 children use skimming, they do so primarily as responses to teachers’ 
incidental instructions. For example, having noticed that the work period would end soon, a grade 4 teacher 
said, “You are looking for the names of people. Try to look for capital letters.” Grade 4 children’s 
knowledge about skimming obtained in this study, therefore, probably reflects what they incidentally learned 
through classroom activities or what they constructed for the first time while responding to our interview 
questions. 

7 All Chi-squares obtained from four-fold contingency tables (e.g., grade 4 vs. grade 8 x skim vs. others) are 
Yates-corrected. 


The paper is based on the second author’s M.A. thesis. The research was supported in part by Canada Council 
Grant S76-0573 to the first author. The authors wish to thank G. Ann Foster for her valuable suggestions for 
conducting this study and the teachers and children of the Essex County Separate School System for their fine 
cooperation. 
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Changes in Performance of Children’s Handwriting 
as a Result of Varying Contingency Conditions 


Verbal praise and privileges administered under varying conditions of contingency and 
directness were investigated in a normal elementary classroom. Dependent variables were 
handuriting performance, sociogrammatic responses, task-ranking, comments and 
complaints. Data were collected in a multiple time-series design using contingent vs. 
noncontingent and direct vs. implicit reward conditions as the independent variables, 
and were analyzed by repeated measures analysis of variance using Scheffé contrasts as 
a post hoc method of isolating effects. Contingent rewards were more effective as 
reinforcers than noncontingent rewards, but implicit rewards possessed aversive qualities. 
Other dependent variable data were nonsignificant. This variability of rewards as 
reinforcers of handwriting supports suggestions made by Bandura (1969), Premack 
(1965), and Skinner (1938) regarding the situational nature of reinforcers. 


Bandura (1969) summarized the three essential factors of successful 
reinforcement procedures: (1) a reinforcer should be powerful enough to maintain 
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responsiveness over time; (2) there must be a contingency between the 
reinforcing events and the desired responses; and (3) a reliable method of 
eliciting the desired response is necessary. Bandura further argued that 
reinforcement is relational rather than an absolute property of the consequence 
used. A particular response event may have lowered reinforcing potency in 
relation to a more highly preferred activity. Thus, rewards gain reinforcement 
qualities in a situational manner (see Premack, 1965). With reference to school 
based situations, it may be that a stimulus which has reinforcing properties for 
some children may not possess reinforcing properties for all children in a chosen 
class. 

In a group situation (e.g., a classroom), rewarding a subgroup for a particular 
activity would (according to vicarious reinforcement principles) result in an 
increase of that activity for those subjects who were rewarded, and also for those 
who observed their peers receive the reward and who judged that they (the 
observers) would later receive that reward for performing the particular activity 
themselves. However, Equity theorists (e.g., Adams, 1965; Adams & Freedman, 
1976) would predict that those not receiving the reward would not perform while 
those receiving would claim that the situation was not fair. 


When examined carefully, it appears that the vicarious reinforcement 
situation is dramatically altered in the normal classroom. In the typical subgroup 
reward situation in normal classrooms, all the members of the class are 
performing the designated task-activity, but only some of them receive rewards 
for performance. Bandura (1971) terms this “implicit reinforcement,” and notes 
that 

When the same deserving performances are praised in one case and ignored in the 
other, the slighted person is not only exposed to observed outcomes, but he 
experiences immediate direct consequences to his own behaviour, which can have 
rewarding, punishing, or extinctive effects. (p. 234) 

The lack of previous research regarding this issue has been highlighted by 
Bandura (1979) who has claimed that the “social interdependence of 
reinforcement effects is an important phenomenon that surprisingly has received 
no attention.” 


The present study was designed to examine the effects of changing the 
contingency conditions of a reward to determine whether this led to changes in 
subject-response. Specifically, contingent vs. noncontingent and direct vs. implicit 
reward conditions were implemented as independent variables. Handwriting was 
chosen as the principal dependent variable because its sensitivity to transient 
motivational states of subjects should reflect changes in subjects’ reactions to 
altered incentive conditions (Birge, 1954; Eysenck, 1948; Flunckiger, Tripp & 
Weinbeck, 1961). Sociogrammatic measurements were taken to detect changes in 
peer/target-subject social interactions. Task-ranking procedures have been used 
to measure the changes in subject-evaluation of an academic task (Foreman, 
1974), and this was included as a dependent variable. Finally, a record of 


subjects’ comments and complaints was kept by teachers to determine subjects’ 
reactions to changing contingency conditions. 
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Method 


Subjects 


Subjects were 22 fifth grade girls (C.A. X = 10.5) and 22 girls and 12 boys 
from a fourth grade (C.A. X = 9.7) from a private school in a large metropolitan 
city (Brisbane in Australia). Teachers were not aware of the actual nature of the 
study and were told that several sentences were being trialled for measuring 
handwriting. 


Design 

A multiple reversal and time-series design was used (see Figure 1) as a 
method of avoiding sources of variability in subjects’ performances which were 
extraneous to the experimental variables (Campbell & Stanley, 1966). The 
descriptive system used represents Baseline, or nil intervention phases by “A”; 
intervention is represented by “B”; when intervention varies subscript numerals 
depict this, e.g., AB,AB,A. The use of repeated measurements of each subject’s 
responses over multiple interventions overcomes the problem of spuriously high 
levels of variability which is often encountered in experimental work in the 
behavioural sciences by using each subject as its own control (Winer, 1971). The 
use of repeated measurements of the chosen task-activity, plus the matching of 
the two classes over differing time periods contribute strongly to the validity of 
the study by reducing effects due to history, maturation, testing and selection. 
Randomization (after matching) of subjects into groups adds to the reliability of 
data by overcoming invalidity due to selection and statistical regression. The 
implementation of baseline phases before, between, and after intervention phases 
helps to overcome invalidity due to history, maturation and testing. There were 
no drop-outs from any subject-group, thus avoiding possible invalidity due to 
mortality, and the method of measurement remained constant throughout 
(reliability procedures are explained below), thus overcoming invalidity due to 
instrumental changes. 


Weeks 
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B3 = Direct contingent rewards 
B, = Implicit contingent rewards 


Figure 1. Total intervention conditions. 
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Procedure 

Subjects were tested for handwriting performance prior to the study, and 
were matched according to this in pairs and then randomly split into two groups 
of equal ability for each class. Each day during the study, every member of the 
class receiving the treatment performed the task activity during a normal 
classroom period between 9.00 a.m. and 10.30 a.m. All subjects were required to 
perform the task on a provided piece of appropriately lined paper approximately 
8 x 21 cm, and write their names on the reverse side of the paper. The teacher 
then collected all subjects’ papers while giving the appropriate intervention-phase 
response. Daily recording of subjects’ comments and complaints was carried out 
by the teacher. Sociogrammatic and task-rank measurements were carried out 
during the baseline periods and during each contingent reward period. 


Dependent variables 


1. Handwriting was evaluated each day by two measures: 

(a) a count of the number of letters written correctly according to the 
standard and style set down in the Queensland Department of Education 
Syllabus for Schools, Language Arts (the syllabus from which the children had 
been taught). Each letter was scored only on its first occurrence in the daily 
sentence for the particular day in question. These five sentences each contained 
every letter of the alphabet in lower case script at least once. Capitals were not 
assessed. The content of these sentences was devised by a panel of three teachers 
experienced with grade 4 and 5 children, to ensure that the levels of grammatic, 
spelling, and cognitive content were appropriate. The particular sentence for each 
day was presented to the class in the manner usual for writing exercises in that 
class. 

(b) a count of the number of errors in presentation. Errors were scored 
according to the following criteria: words added/omitted; letters added/omitted; 
misspelling; punctuation errors. 


2. Sociogrammatic status was measured weekly with the exception of weeks 2 
and 4 for Class 1, and 18 and 19 for Class 2 by asking each subject to complete a 
form which asked whom the subject would most like to sit next to, and whom 
the subject would least like to sit next to. Subjects’ own names were written on 
the bottom of the form. Acceptances and rejections were totalled for each group 
as a measure of group popularity. This measure was based on Cronbach’s (1970) 
suggestions for measuring sociogrammatic status. 

3. Task-ranking was evaluated at the same time as sociogrammatic status by 
the rank which each child awarded each of the curricular-areas (including the 
task activity) on a voting sheet as a measure of the evaluation of that activity 
(after Foreman, 1974). Each subject voted for the total of nine curricular-areas 
on a scale where 1 represented the most-liked choice and 9 represented the 
least-liked choice. The status of handwriting was calculated by subtracting the 
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actual vote preference given by a child from the total number of choices available 
and using the answer obtained as that child’s vote. Totals for groups were then 
calculated. 


4. Comments and complaints were measured by the teacher entering a record 
of unsolicited comments and complaints each day onto a daily score sheet. Where 
possible, note was made of the actual wording of the comment and/or complaint. 
Totals were calculated across groups. Teachers were requested to score as a 
complaint any verbalization which was of a derogatory nature concerning the 
task-exercise, the teacher, the rest of the class, or the reward-structure. 
Verbalization of a nonderogatory nature was to be recorded as a comment. Both 
comments and complaints were recorded at any time during the entire school 
day. 

Independent variables 


Verbal approval and tangible rewards were given to the individual subjects of 
the target group in the form of the teacher’s comment: “That’s good writing, X 
(subject’s name). Take a house-point.”' The alternative subject-group received no 
approval or points, the teacher’s only comment to them being “Thank you” as 
the daily response sheets were collected. Details of these procedures are described 
below. The choice of the reward of teacher approval and house-points was based 
on the previous successful use of these rewards in these classes to reinforce 
mathematics skills activities. Used within an AB design over 12 daily trials 
(baseline = 6 days, rewards = 6 days), these rewards effectively reinforced 
mental arithmetic problem-solving (X, = 7.66, Xp, = 15.16, t = 5.138, df = 10, 
Deas Oks 

Experimental phases 

(a) Noncontingent reward: 

Baseline 1. Handwriting behaviour was recorded for each subject for five 
days with the teacher’s only comment being “Thank you” as the separate pieces 
of writing paper were collected. Any questions were politely answered in a 
noncommittal fashion by the teacher. 

Intervention 1 (Noncontingent rewards to groups 1la/2a). During this phase 
the teacher responded to group 1la/2a subjects individually by picking up each 
subject’s writing sheet, scanning it for approximately three seconds, and saying 
the standard reward sentences mentioned above. For group 1b/2b, the teacher 
picked up each subject’s writing sheet, scanned it for approximately three 
seconds and said “Thank you.” The teacher kept all writing sheets. 

Baseline 2. Conditions during this phase were as in Baseline 1. 

Intervention 2 (Noncontingent rewards to groups 1b/2b). During this phase, 
the conditions of Intervention 1 were enacted, but with the groups reversed. That 
is, group 1b/2b received the reward regardless of performance, and group 1a/2a 
received no reward. 

Baseline 3. Conditions during this phase were as in Baseline 1. 


(b) Contingent reward: 

Intervention 3 (Direct rewards to groups la + 1b/2a + 2b). On the first 
day of this phase, each teacher told her class that she wanted them to try the 
writing exercises again, and re-presented the daily sentences as before. Upon 
completion of each day’s task, the teacher said: “I don’t have time to correct 
these now, but I’ll give them back to you tomorrow with the results on them, ” 
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and then collected each subject’s task-sheet, saying only “Thank you” as she did 
so. The next day, immedately prior to asking the children to write the daily 
sentence on their sheets, the teacher returned each subject’s sheet from the 
previous day with the subject’s score of letters correct written on it. The teacher 
explained that if the subject had improved over the previous day’s performance, 
there would also be a “plus” on the sheet; if the score was equal to that of the 
previous day, there would be an “equals” sign; if less than on the previous day, 
there would be a “minus” sign on the sheet. The teacher handed out each 
subject’s sheet individually and commented to those who had scored a plus: 
“That’s good writing, X. Take a house-point.” Perfect scores (26/26) were always 
awarded a plus and the reward. These sheets were collected after a few minutes’ 
discussion, and the teacher then instructed the class to complete the day’s 
exercise which she then collected. 

Intervention 4 (Direct rewards to groups la/2a: Implicit rewards to groups 
1b/2b). During this phase, the teacher rewarded only groups 1la/2a, contingent on 
their results as in Intervention 3. Group 1b/2b were not rewarded, but the 
teacher explained: “I’ve only got time to correct some of your writing for a few 
nights because I’m so busy, but I want you all to do as well as you can. I don’t 
know which ones I'll choose to correct each night, so you all had better try hard.” 
Only group la/2a had their previous day’s task sheets returned the next day, 
even though all subjects were given to expect that they might be the ones to 
receive the reward on any day. 

Intervention 5 (Direct rewards to groups 1b + 2b: Implicit rewards to 
groups la and 2a). During this phase, Intervention 4 was replicated with groups 
1b and 2b receiving the reward. 

Intervention 6 (Contingent rewards to groups la + 1b/2a + 2b). This was 
a return to Intervention 3 conditions. 

Baseline 4. This was a return to Baseline 1 conditions. 


Reliability measures 


Hall (1970) suggests that direct measurement of data which may be 
re-assessed at any time has the advantages of precision and permanency over 
many observer-recorded forms of data. Daily measures were collected of each 
subject’s performance on prepared slips of paper and retained for later 
cross-checking and re-sorting. 


The correction of the daily activity-task was carried out each day after school 
hours by the researcher with an independent evaluation on at least one day of 
each week. All corrections were performed by qualified teachers experienced at 
the grade 4 and 5 level. The independent evaluation was “blind,” i.e., previous 
subject scores and the reseacher’s evaluation were unknown. The independent 
evaluator was also ignorant of the intervention procedures and the groups to 
which subjects belonged. The choice of which day was to be the checking day 
was performed by a third party on a random-choice basis. There was no feedback 
to either the researcher or the independent evaluator as to the respective scoring 
during the study. Inter-rater reliability was calculated and data appear below. 


At the end of the study the researcher performed a posttest rescoring on one 
day’s activities from each of the preceding weeks. Conditions regarding reliability 
were as for inter-rater evaluation procedures. Correlations for both inter-rater 


and posttest rescoring reliability were calculated by use of Pearson’s Product 
Moment Correlation. 
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Results 


Reliability 

Reliability checks were conducted on days 3, 7, 8, 13, 18, 19, 23, 56, 58, 63, 
79, 82, 96, 94, 98, 102 for groups la and 1b, and on days 84, 87, 91, 100, 103, 109, 
111, 118, 124, 130, 134, 186 for groups 2a and 2b by inter-evaluator methods. 
Post-checks were conducted for groups la and 1b on days 3, 8, 13, 18, 23, 58, 63, 
78, 83, 88, 94, 100, 103, and on days 83, 89, 94, 99, 104, 109, 114, 119, 121, 126, 
134, 136 for groups 2a and 2b. Inter-rater reliability ranged from .94 to .99 (la 
and 1b) and from .98 to .99 (2a and 2b). 


Data from the Study 


The total data collected during the 29 weeks of the study are shown in 
Figure 2, from which a lack of apparent change in the levels of target-activity 
response during weeks 1 to 5 (groups la and 1b) and weeks 17 to 21 (groups 2a 
and 2b) is evident. Marked variations in response levels for the target-activity do 
appear in the weeks following this initial noncontingent phase in all groups. It 
would appear that response levels for the target-activity rose when rewards were 
direct and contingent for both groups, but fell at all other phases. Statistical 
analysis of this apparent change in response level is discussed below. 


Weeks 
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Figure 2. Number of letters correct and errors in presentation for all groups. 
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The use of teacher-praise and house-points as contingent rewards proved 
effective in changing handwriting behaviour with the present sample of children. 
Change in response on the dependent variables of letter correct over 
interventions (F,,+,, (12,240) = 155.4, p < .01; F,,+., (11,852) = 135.0, p < 
.01) points to a significant change in response when the independent variable of 
contingency was altered. 

“Between subjects” effects were not significant. That is, within each class, 
both groups reacted similarly to intervention. 

Data collected on the rate of errors in presentation indicate a significant 
difference over interventions for all groups (F,,+,, (12,240) = 8.10, p < .01; 
Poa 1, 852)* = 2p): 

Subject-rank responses and comments and complaints remained virtually 
unchanged during the study, but sociogram responses fluctuated, though in no 
readily apparent order. Statistical analyses of these variables revealed that none 
of them was significant. 


Contingent vs. Noncontingent Rewards 


(a) Letters correct. The hypothesized relationship between contingent and 
noncontingent reward conditions was verified by Scheffé contrasts on the data 
(contrasts la vs. lb = 18.35, p < .01, 2a vs. 2b = 15.89, p < .01), indicating 
that contingent rewards were more effective in producing correct handwriting 
behaviour than noncontingent rewards, for all groups. 


(b) Errors in presentation. Scheffé contrasts revealed that conflicting results 
were found for the two classes (contrast la vs. lb. = 2.42, n.s.; 2a vs. 2b =-3.57, 
p< .01). Class II (groups 2a and 2b) data supported the findings that errors 
were more common when rewards were noncontingent upon performance. Data 
from Class I (groups la and 1b) gave no support to any part of the earlier results 
with regard to errors in presentation. The apparently contradictory nature of 
these data may be partly accounted for by reference to the numbers of errors 
recorded for each group, which shows that the actual occurrence of errors was 
very low over all phases of the study (range -.16 to 1.44). 


Direct vs. Implicit Rewards 


(a) Letters correct. Concerning direct vs. implicit rewards, Scheffé contrasts 
showed that direct contingent rewards produced significantly higher levels of 
correct handwriting responses than implicit contingent rewards (contrast la vs. 
tb ="10:38,,p—. Ole 2a Vs, 204 18.26) pp UL 

From Figure 2 it is seen that, during weeks 17 (la + 1b) and 24 (2a + 2b), 
one group in each class received direct rewards as it had during the previous two 
(2a) or three (1a) weeks of intervention, whilst the other group in each class (1b, 


2b) received implicit rewards. Both groups in each class declined equally during 
this week [Class I: la X, = 18.37; X, = 11.61; 1b X, = 19.02; X, = 12.56. Class 
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Dita X= 14.59; X5 = 7.82; 2b: X, = 17.58, X, = 9.29. Fi.43, =..43 (1,20); 
n.s.; F 5, +, = 1.87, (1,32), n.s.]. Similarly, during weeks 18 (la + 1b) and 25 
(2a + 2b), the alternative group (1b, 2b) received direct rewards as it had prior 
to the week immediately previous, whilst the other group (la, 2a) received 
implicit rewards. Both groups again declined equally as during the previous week 
[Class I: la X, = 11.61, X, = 5.60; lb X, = 12.56; X, = 8.84. Class II: 2a X, 
= 7.82, X, = 4.12; 2b X, = 9.29, X, = 5.36]. There were some individual scores 
at variance to these group scores, but only 5 out of 56 subjects responded 
differently to the group norm. In itself this overall conformity in terms of 
responses argues for the presence of powerful treatment effects. 


(b) Errors in presentation. Data collected on the rate of errors in 
presentation revealed conflicting results once again for the two classes (la vs. 1b 
= 3.21, p < .01; 2a vs. 2b = -2.31, n.s.). As mentioned above, the apparent 
contradictory and inconclusive nature of these results regarding errors in 
presentation may be due to the actual occurrence of errors being very low over 
all phases of the study. 


Discussion 


The data from the study have supported the previous literature in relation 
to the issue of contingent vs. noncontingent rewards. Verbal praise and 
house-points proved effective reinforcers of correct handwriting responses in two 
normal elementary school classrooms. Noncontingent rewards were ineffective as 
reinforcers of the same task activity. 


With regard to the issue of direct vs. implicit rewards, data from this study 
demonstrated that, when implicit rewards were applied to one group, the 
performance of that group declined. However, the performance of the peer group 
also declined during this phase, although it received (hitherto reinforcing) direct 
rewards. Following this, direct application to both groups reproduced earlier 
reinforcing effects on all subjects. It requires some explanation that this reward 
could: (1) be chosen because of its effectiveness as a reinforcer in a similar 
situation (arithmetic skills, general classroom behaviour); (2) prove to be an 
effective positive reinforcer on the performance of a chosen academic-skills task 
(handwriting); (3) be shown to possess aversive properties when used as an 
implicit reward; and then (4) be once more an effective reinforcer for the same 
academic-skills task. 

It may be the observation of the presence of implicit rewards by both those 
subjects who are receiving them (and also by those subjects who are receiving 
direct rewards) which is responsible for the alteration in reward as reinforcer 
effectiveness. That is, rather than the actual reward itself, the perceived nature 
of the reward condition appears to be vital, with some kind of cognitive 
evaluation of the reward conditions by the subjects. This evaluation of reward 
conditions was followed by a controlled change in task-response by the subjects. 
The reward may have lost its reinforcement effectiveness and gained an aversive 
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effect as subjects perceived inequalities in the application of the reward. That is, 
“unfair” reward conditions may have been observed by the subjects and a 
consequential lowering of reward-value followed. As the value of the reward was 
lowered by the subjects, so their performance deteriorated. 


Bandura (1969) suggested the effect of “central processes in behavioral 
inhibition” (p. 304) to account for the phenomena of vicarious punishment. The 
performance decrement displayed by all subjects in the present study suggests 
these processes as a variable which needs to be considered in the construction of 
response-reward learning situations. Skinner (1950) argued __ that 
subject-evaluation of the reward is at least partially responsible for the 
effectiveness of the reward as a reinforcer. Premack’s (1965) claim that rewards 
gain reinforcement qualities in a situational manner appears to be verified. As 
summarized by Bandura (1969) 

Reinforcement is a relational rather than an absolute property of the activity. A 
particular response event will have no reinforcing potency in relation to a more 
highly preferred activity. (p. 222) 

In the present study, the response event of verbal praise plus house/class 
points lost its reinforcing potency in relation to the more highly preferred 
activity of being fair to one’s classroom peers. Subject withdrawal from the task 
activity of handwriting may be seen as a means of restoring this fairness. 


Conclusion 


The present study demonstrated that contingent rewards were more effective 
as reinforcers than were noncontingent rewards in changing the handwriting 
behaviour of normal primary school children. The effectiveness of direct 
contingent rewards for changing this behaviour was also clearly demonstrated. 
The aversive effects of implicit contingent rewards were unexpected, and 
constitute a major finding herein. The actual nature of subjects’ cognitive 
processes as regards their evaluation of implicit reward conditions was not 
clarified by the data obtained from sociogrammatic and task-evaluation measure. 
Further research is necessary to elucidate this issue. 


The present study has replicated and verified earlier research which 
demonstrated the ineffectiveness of noncontingent rewards. Teachers (or school 
psychologists, counselors, etc.) who do not evaluate and feedback results of set 
tasks to pupils run the risk of aversive outcomes. Secondly, the use of operant 
techniques with a subgroup of a class (e.g., slow learners, hyperactive children, 
behaviour-problem children) to reward a behaviour which is being performed by 
the rest of the class, may have detrimental effects upon the level of that 
behaviour as performed by the whole class. 


Notes 


1. Each child in the school was allocated to one “House” on the basis of intramural groupings. House-points were 
totalled at the end of each school term and the winning house members awarded special privileges of their choice. 
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PERSPECTIVES 


Educational Consultation: 
A Summary of Four Alberta Studies 


Edward A. Holdaway 
The University of Alberta 


and 


Ross H. Millikan 
The University of Melbourne 


The provision of consultative assistance is one of the five generally recognized 
functions of supervision, along with staffing, motivating, developing programs, 
and evaluating. However, as compared with these other four functions, the 
consultative function has received surprisingly little attention in research 
literature. Educational consultation, being a multi-faceted activity, can involve 
complex interactions between the consulter and the consulted. Blocher (1975), 
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while emphasizing an empirical approach to consultative practices, outlines the 
complexity of the consultant’s role: 


If there is any area of general agreement in the literature on consultation, it is that 
consultants must proceed in careful, planned ways to generate data upon which to 
base strategies and decisions. They must attend to complex personal and social 
dynamics, and use knowledge about individual, group and organizational change 
processes in helping client systems to manage stresses, make decisions, and solve 
problems in effective and humane ways. (p. 171) 


Such complexity has been partly responsible for a feeling of concern among 
some professors and school district supervisors. One stated purpose of 
consultation is the improvement of instruction—the frequent lack of 
improvement following consultative interaction has reinforced this feeling of 
concern. To some extent this lack of improvement may result from inadequate 
knowledge of the consultative process which occurs in schools. 


Traditionally, this process has been viewed as one in which an “expert” 
assesses the level of competence of a teacher and suggests behaviours which 
could lead to improvement. This expert is usually a person occupying a formally 
designated position such as a coordinator or consultant (mainly “staff” positions) 
as compared with a principal or assistant superintendent or director (mainly 
“line” positions). Involvement of a staff officer, who is essentially a giver of 
advice, should lead to interaction with teachers and exchange of information in a 
predominantly neutral, helpful and non-threatening setting. The formal line 
evaluator position carries the explicit responsibility of evaluation of the 
supervised teacher. Partly because of perceptions held by both parties, no 
supervisor, whether staff or line, can completely remove evaluation from the 
provision of assistance to a teacher. Such supervisory evaluations about a 
subordinate, whether intentional or unintentional, formative or summative, may 
precede and/or follow any interaction. From the teacher’s standpoint, such 
evaluations are potentially negative, especially where the supervisor determines 
the teacher’s need for consultative assistance. 

A different way of viewing the consultative process is to consider it as one 
initiated by a teacher who identifies an aspect which causes sufficient concern to 
generate the asking for advice. In this context, the “consultant” is anyone who 
interacts with classroom teachers for the purpose of providing consultative 
assistance in response to teacher requests for advice or help in relation to 
perceived educational concerns and problems. That is, the classification of 
consultant is not confined to formally designated supervisory personnel. The 
underlying premise of person-centred supervision is that each individual is 
capable of growth and desires to mature professionally. This positive supervisory 
stance and optimism about personal motivation and desire for growth are 
expressed in McGregor’s “Theory Y” (Hicks & Gullet, 1975). 

This second view was the one used in the four studies described in this 
article. One of the authors who supervised all four studies (Holdaway) assumed 
that changes in behaviour of teachers are more likely to result when the need is 
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recognized by teachers, although this assumption was not tested. This does not 
deny that supervisory personnel can identify certain behaviours which should be 
altered in order to improve the quality of instruction. 


These four studies were conducted by staff and students of the Department 
of Educational Administration at The University of Alberta in the 1970’s. The 
objectives were to gain information about the consultative needs and practices of 
teachers at different grade levels in Alberta, and to make _ relevant 
recommendations for research and practice. 


Relevant Literature 


Many textbooks and periodicals concerned with educational supervision have 
treated the topic at considerable length, especially from the perspective of 
experts providing assistance to teachers based on the experts’ assessments. Less 
literature is available on the matter of peer consultation. 


One example of advocacy of peer consultation is provided by Parsons (1971), 
who reached this conclusion on the basis of his experience and research: 


In view of the effectiveness of “other teachers” in helping staff members, it is 
recommended that greater opportunities be provided for teacher interaction in school 
systems by restructuring of teacher roles. Teachers should be given greater freedom 
from “in-class” responsibilities to share new ideas and techniques with their 
colleagues. (p. 8) 


Blumberg (1974) adds weight to Parsons’ recommendation in stating: 


[There is] a growing emphasis on teachers serving as supervisory-helping agents for 
one another in a systematic fashion. Teachers have long given help to their 
colleagues in an informal manner. The difference now is that peer-oriented 
supervisory structures are being built into the formal system. (p. 151) 


In similar vein, Alfonso (1977, pp. 594-601) considers that a new sense of 
teacher responsibility is imminent when they voluntarily exchange aid and advice 
with their colleagues in the interests of improved personal instruction. He 
believes that peer supervision has the potential to bridge the gap between 
existing sporadic superordinate supervisory efforts and the improvement of 
classroom teaching. The twin aspects of effectiveness and efficiency of peer 
supervision are dealt with by Sergiovanni (1975) who, in summing up much of 
the recent writing on supervision, states: 


Supervision is a neglected art in need of revival. . . . Improvements in education will 
increasingly depend upon more fully using and improving the human resources 
presently available to schools rather than upon introducing substantial new human, 
material, and economic resources. (p. 1) 


Lippitt and Lippitt (1978, p. ix) present the same “the function of 
consultation is part of the role and function of all those who lead, direct, teach 
or interact as friends or peers with others.” They further comment (Lippitt & 
Lippitt, 1978, p. 1) that “the role of a growing number of persons in our society 
is labeled consultant to describe their helping functions.” 


Because of the breadth of possible interactions which can legitimately be 
classified as consultation, ranging from formal supervision to spontaneous 
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advice-giving, the confining of consultation to holders of particular positions is 
difficult. In addition, difficulty can be experienced in endeavouring to determine 
whether a consultative interaction is formal or informal. 


Increased recognition and use of peer consultation is also justified because 
consultation, regardless of the work setting, is firmly based on the helping 
relationship. Rogers (1961, pp. 39-40), one of the early writers in this field, 
describes the helping relationship as one “in which at least one of the parties has 
the intent of promoting the growth, development, maturity, improved functioning, 
[and] improved coping with life of the other.” Brammer (1977), in focussing more 
specifically upon the roles of both the “helper” and the“helpee,” as well as upon 
the intended goals of the interaction, states: 


Broadly speaking, “helping” means assisting other persons to reach goals that are 
important to them. In the context of the specialist helper, it means facilitating 
personal growth in the direction chosen by the person being helped, as well as 
toward the helper’s own model of an effective person. “Meeting the other’s needs” is 
a common descriptive phrase, but the real goal of all helping is self-help, which 
means teaching people to meet needs through their own efforts. (p. 303) 


Carkhuff (1969a and 1969b) adds that people continually discuss their 
problems, large and small, significant and trivial, with colleagues and _ peers, 
friends and acquaintances, and superordinates and subordinates. The choice of 
helper and mode of interaction is generally determined by the nature of the need 
or problem, for while there is a recognized need for specialists trained to cope 
with the more complex and extreme manifestations of human behaviour, many 
personal needs can and are met through informal, nonprofessional interpersonal 
interactions with nonspecialist lay personnel. Carkhuff (1971) suggests further 
that many helping relationships involve helpees who are functioning normally by 
societal standards, but who seek advice or help on everyday, commonplace 
problems to increase or improve personal effectiveness and/or satisfaction. 

However, from an empirically based standpoint, Kolb and Boyatzis (1974) 
conclude: 

The model of the help is unfortunately not a precise set of mathematical 
interrelationships among operationally defined variables, but rather is a preliminary 
attempt to translate case observations and findings from studies of helping 


relationships in education, welfare, assistance and therapy programs into a single 
theoretical framework which will eventually allow operational definitions of variables 


and tests of interrelationships. (p. 371) 

Carkhuff and Berenson (1967, p. 174) add that persons in need of help are 
often unable to find facilitative helpers appropriate to their level of functioning 
in their immediate environments. In applying this same concern to the potential 
effectiveness of supervisory assistance to teachers, Blumberg (1974) considers 
that: 


However, there is a possibility that supervisors’ insights into and skills to deal with 
interpersonal relations are inadequate for a helping relationship; that, if available at 
all, the training provided for supervisors by school systems is inadequate; or that 
supervisors are selected without regard to the behavioral demands of the job. More 


than likely, it is a combination of all three. (p. 110) 
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The question of effectiveness is dealt with further by Carkhuff and Berenson 
(1967, p. 4) who indicate that there is an extensive body of evidence which 
suggests that helping relationships can be either facilitative or retarding, and can 
have constructive or deteriorative effects on intellectual as well as psychological 
indexes. These facilitative or retarding effects can be accounted for by a core of 
primary dimensions which are shared by all interactive human _ processes, 
independent of theoretical orientation. These core dimensions are empathic 
understanding, positive regard, genuineness, concreteness or specificity of 
expression, and self-evaluation or self-exploration. Similarly, Rogers (1961, p. 44) 
noted that the helpee’s perception of the helper’s attitudes made a significant 
difference to the effectiveness of the relationship. The degree of trust the helpee 
feels towards the potential helper will determine the extent to which the helpee 
is prepared to divulge or share his problem or need with the helper. 
Nevertheless, the teacher’s dilemma is that his needs for help conflict with his 
needs to be seen as competent: “if I convey my needs for help and feelings of not 
knowing what to do, it is likely that I will be seen as a weak and marginal 
teacher” (Blumberg, 1974, p. 144). Formal aspects of roles must not be permitted 
to interfere with the establishment of a genuine, concrete, empathic, and 
respectful helping relationship. 

The role of the consultant is taken further by Dinkmeyer and Carlson (1973, 
p. 85) who propose that “The consultant’s role is perhaps best characterized as 
the architect of change.” They later defined the consultant’s role as a facilitator 
of human potential. The extent to which a consultant is effective in reaching 
such goals is seen by Blocher (1975, pp. 159-160) as being dependent upon at 
least six factors: 


1. The method of intervention; 

. The characteristics of the consultant; 

. The characteristics of the consultee; 

. The nature of the concern; 

. The nature of the behaviour change desired; and 

. The nature of the context in which the consultation takes place. 


o> Or & CH DO 


All six of these factors can readily be translated into educational terms. For 
example, the method of intervention could relate to whether a request for 
assistance is initiated by a teacher or whether the supervisor identifies a need 
and without being approached offers opinions concerning desirable changes in 
behaviour. Personal qualifications, grade level, teaching subject, attainment of 
competence and perceived power of the consultant to affect a teacher’s career 
relate to one or more of Blocher’s six factors. 


Methodologies 
A preliminary study conducted by Plamondon (1973) obtained data from 135 
teachers in five schools in one school district in Alberta. These teachers, whose 
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responsibilities covered the K-12 range, represented 97 percent of the 141 who 
received a questionnaire. Plamondon also interviewed four teachers from each of 
the five schools. The four concerns mentioned most commonly by the 
respondents were “Obtaining information on background of students,” “Assessing 
the need for remedial programs,” “Implementing remedial programs,” and 
“Developing course outlines.” For the general area of “Curriculum and program 
development concerns,” respondents favoured department heads as their first 
consultative choice. For advice in the “Instructional materials” area, other 
teachers were most frequently mentioned as first choice, while guidance 
personnel were favoured for assistance with concerns in the “Student 
measurement and evaluation” and “Meeting specific students needs” areas. 
However, of more importance than Plamondon’s actual results concerning 
consultative needs and practices was the confirmation that this was a useful area 
of research which required detailed study. The findings also led to the decision 
to investigate the consultative needs and practices of teachers in schools 
categorized by grade level. Consequently, Plamondon’s research was an important 
stimulus for the three major studies described in detail in this article. 


Table 1 presents an overview of these three studies. While an obvious 
similarity exists among their methodologies and instruments, the approach taken 
by each researcher in relation to the specific detail of data collection was 
substantially individual, and was predetermined to some extent by the conceptual 
focus adopted and by experiences of the previous researchers. These subsequent 
studies sought information from teachers in each of the major grade-level 
subdivisions. Haughey (1976) collected data from all 80 elementary school 
teachers (kindergarten-Grade 6) in three schools within the same school district 
examined by Plamondon. Harrison (1978) obtained information from 209 junior 
high school teachers (Grades 7-9) out of 358 approached in 16 schools in five 
Alberta school districts. Millikan (1979) received 112 teachers’ usable replies out 
of 260 approached in three senior high schools (Grades 10-12) within one large 
school district. Follow-up interviews were conducted with all 80 teachers in the 
elementary school study, and with 18 teachers in the senior high study. Both the 
junior high and senior high studies also obtained questionnaire data from central 
office consultants employed by the school systems; some of these consultants 
working with senior high teachers were interviewed. Choice of the numbers of 
school systems, schools, teachers, and consultants was determined by such factors 
as willingness of school systems and schools to be involved, travel distances, and 
costs. Differences in the methodologies used in the studies «vere largely accounted 
for by these aspects, but also by insights gained from conducting each study. 


While all three studies were directly related in subject matter, comparison of 
the findings was restricted by significant differences in instrumentation. For 
example, “educational concerns” varied in number, general focus and specific 
wording, not all studies sought equivalent information on related issues, and 
different emphases were placed upon interviewing. 
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OVERVIEW OF RESEARCH METHODOLOGIES EMPLOYED IN THE 


TABLE 1 
STUDIES 
K-12 Elementary 
Aspect (Plamondon,1973) (Haughey,1976) 
Population 5 selected schools 3 selected schools 
studied in one district in same school 
district as 
provided data for 
Plamondon 
Teacher 3 sections 3 sections 
questionnaire - demographic - demographic 
- 28 concerns - 21 concerns 
- 7 general issues - social conflict 
relationships 
141 distributed 80 distributed 
97° returned 100°. returned 
(usable) 
Teacher 20 interviewed 80 interviewed 
interviews 
Consultant nil nil 
questionnaire 
Consultant nil nil 
interviews 


Junior High 
(Harrison,1978) 


Senior High 
(Millikan,1979) 


16 schools: 5 
selected at 
random in one 
school district, all 
4 in another 
district, all 4 in 
one county, and 
all 3 in another 
county. 


6 sections 
- demographic 
- 38 concerns 
3 prime 
concerns 
satisfaction 
provision to 
colleagues 
--unsought 

needed assistance 


358 distributed 
58°. returned 
(usable) 


nil 


66 distributed 
73° o returned 
(usable) 


nil 


3 selected schools 
in one school 
district 


7 sections 

- demographic 

- 39 concerns 

- 3 prime 
concerns 

- satisfaction 

provision to 
colleagues 

unsought 


needed assistance 


- 9 general issues 


260 distributed 
43°. returned 
(usable) 


18 interviewed 


31 distributed 
65°. returned 
(usable) 


7 interviewed 
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Personnel within the school (such as principal, counselor, and teacher 
colleague) who provided such assistance were classified as internal consultants, 
while those from external agencies (such as universities or central or regional 
offices of education) were described as external consultants. Harrison and 
Millikan obtained information from both internal and external consultants, but 
Haughey approached internal personnel only. 


The three studies explicitly defined educational consultation as the 
teacher-initiated seeking of assistance and/or advice about education-related 
concerns. All studies requested teacher respondents to nominate from lists of 
educational concerns those (1) for which they had either sought assistance and/or 
advice or (2) for which assistance either was or might be desired during the 
current school year. 


Results 


Major Consultative Needs 

The six items with the highest percentage frequency response of teachers are 
presented in Table 2. For the group of 80 elementary teachers, “Diagnosing 
learning difficulties” was clearly the major concern, being mentioned by 85 
percent. A related function, “Developing remedial programs” was the second most 
commonly mentioned concern (75 percent), followed by “Selecting instructional 
materials” (73 percent). Of the six highest response items selected by the 
elementary teachers, two related to the general Counselling/Student Services 
category, two to Curriculum/Program, one to Instruction/Methodology, and one 
to Audiovisual. 


At the junior high level, three of the six highest response items related to 
Counselling/Student Services, and one occurred in each of the 
Instruction/Methodology, Curriculum/Program, and Audiovisual categories. The 
two most frequently mentioned items were “Obtaining student background 
information” (76 percent) and “Developing course outlines” (74 percent). Two of 
the six major senior high concerns were in the Curriculum/Program category, two 
were in Counselling/Student Services and two in Instruction/Methodology. The 
most frequently mentioned senior high concern was “Dealing with tardiness 
and/or absenteeism” which was identified by 83 percent of respondents. However, 
this concern was not included on the list presented to the elementary and junior 
high teachers. No reference to remediation occurred among the six highest 
response items in the junior high and senior high results. Two items appeared in 
the nine most frequently mentioned concerns for all three grade levels: 
“Developing course outlines” (55 percent elementary, 74 percent junior high, and 
70 percent senior high) and “Selecting instructional materials” (73 percent, 68 
percent and 73 percent). 
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TABLE 2 


PERCENTAGE FREQUENCY DISTRIBUTIONS 
OF TEACHERS’ NEEDS FOR CONSULTATIVE ASSISTANCE 
FOR THE SIX HIGHEST RESPONSE ITEMS 


Elementary Junior High Senior High 
(N=80) % (N= 209) % (N=112) % 
Diagnosing learning 85% Obtaining student 76 Dealing with tardiness 83 
difficulties background information and/or absenteeism 
Developing remedial 78 Developing course 74 Planning and/or 78 
programs outlines utilizing evaluation 
procedures 
Selecting instructional 12 Selecting long-use 68 Determining Th 
materials instructional materials expectations for student 
achievement 
Operating audiovisual 64 Diagnosing learning 66 Determining established 77 
equipment difficulties school 
programs/standards 
Developing course 55 Selecting audiovisual 63 Dealing with student et 
outlines materials personal problems 
Obtaining student 54 Diagnosing individual 61 Selecting “best” 73 
background information differences instructional materials 


ara aly , : 
This means that 85 percent of elementary school teachers indicated their need for consultative 
assistance in the task area “Diagnosing learning difficulties.” 


Frequency of Need 


The frequency with which teachers sought consultative assistance was 
reported in only the elementary and senior high studies. In the elementary school 
study, teachers were asked to recall the actual number of their in-school 
interactions in the September-February period; attention was focussed on the 
interaction patterns within the three schools. The senior high study sought 
approximations of the number of occasions of seeking assistance, using the 
classifications “one to four occasions,” “five or more occasions,” and “recurring”; 
the recurring classification applied to requests for assistance which occurred 
frequently and fairly regularly. The majority of assistance requested by senior 
high school teachers in relation to all 39 listed concerns was on “one to four 
occasions” over the current school year. 
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TABLE 38 


PERCENTAGE FREQUENCY DISTRIBUTIONS 
OF THE EIGHT MOST FREQUENTLY RECURRING CONCERNS 


FEL SL RE Te ea aaa Sa ne ee pce 


Elementary Junior High Senior High 
(N=80) % (N=209) of, (N=112) % 
Developing instructional 92% Obtaining student 87 Determining 73 
materials background information expectations for 


student achievement 


Developing questioning 91 Motivating students 85 Planning/utilizing az 
techniques evaluation procedures 
Evaluating lessons/units sil Diagnosing individual 84 Selecting “best” 70 

differences instructional materials 
Teaching concepts, 89 Selecting instructional 84 Dealing with student 70 
constructs, materials personal problems 
generalizations 
Developing audiovisual 89 Planning behavioral 81 Improving student 65 
materials objectives motivation 
Developing remedial 89 Selecting performance 80 Dealing with tardiness 59 
programs goals/objectives /absenteeism 
Developing unit 87 Diagnosing learning 80 Obtaining information 53 
/lesson plans difficulties on budgeting/money 

control 

Developing/using small 86 Solving teacher-teacher 80 Determining course 53 
group activities conflict objectives 
“This means that 92% of elementary teachers experienced a recurring need for consultative 


assistance for the task area “Developing instructional materials.” 


Information on the recurring concerns of teachers for assistance with 
particular problems was obtained in all three studies. Table 3 shows data related 
to the eight most frequently mentioned recurring concerns for each group of 
teachers. The most frequently mentioned concerns were “Developing instructional 
materials” (identified by 92 percent of elementary teachers), “Obtaining student 
background information” (87 percent of junior high teachers), and “Determining 
expectations for student achievement” (73 percent of senior high teachers). 
However, many other concerns had percentage frequencies of response very close 
to these maxima. Concerns related to instructional materials were the third most 
frequently mentioned by senior high school teachers (70 percent) and fourth 
most frequently mentioned by junior high school teachers (84 percent). 
“Obtaining student background information,” the most common recurring concern 
of junior high teachers, was not listed among the eight most common recurring 
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concerns of elementary or senior high school teachers. “Determining expectations 
for student achievement,” the most common senior high recurring concern, was 
also prominent on the junior high list, where it was called “Selecting performance 
goals/objectives” and mentioned by 80 percent. Diagnosis, evaluation, and 
motivation were also frequently identified as recurring concerns for which 
consultative assistance was sought. 


Consultative Personnel 

The consultant classification most frequently mentioned as the source of 
needed assistance in all three studies was “teacher colleague in the same school,” 
as shown in Table 4. However, some differences in the data collection and 
reporting procedures for the various studies need explanation. In the junior high 
study, all in-school personnel, excluding the principal, were grouped under the 
global classification of teacher colleague, whereas in the other two studies, 
in-school personnel were differentiated according to formal designations. Further, 
no percentage frequency distributions of the provision of assistance could be 
calculated for the elementary school study because of incomplete specific detail. 


However, given the above restrictions on direct comparisons of consultative 
personnel, the following relationships were apparent. When all senior high 
in-school personnel (excluding the principal) were combined, the significance of 
the teacher colleagues as the prime source of consultative assistance in both the 
junior high and senior high studies was apparent, representing 61 percent and 83 
percent, respectively. When the assistance provided by the principal was 
included, in-school consultative personnel accounted for 80 percent of total 
assistance provided to the junior high teachers and 93 percent to the senior high 
teachers. No use of external consultative personnel was reported by the 
elementary teachers. 


Related Issues 


Satisfaction. The three studies sought data from teachers about their ratings 
of satisfaction experienced in relation to the consultative assistance provided. 
The same classifications were used: Very Satisfied, Satisfied, and Not Satisfied or 
Unsatisfied. 


Over 50 percent of the elementary teachers were Very Satisfied with the 
assistance that they received with respect to these concerns: “Using 
problem-solving/inquiry/discovery method” (60 percent), “Selecting instructional 
materials” (56 percent), and “Obtaining student background information” (51 
percent). On no concern were over 50 percent of either the junior high or senior 
high teachers Very Satisfied with the provided assistance. When the Very 
Satisfied and Satisfied categories of elementary teachers’ responses were 
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TABLE 4 


PERCENTAGE FREQUENCY DISTRIBUTIONS 
OF THE PERSONNEL MOST FREQUENTLY CONSULTED 


eee 
Elementary Junior High Senior High 


(N=80) % (N= 209) % (N=112) n 


Teacher colleagues Teacher colleagues 61 Teacher colleagues 31 
(same school) (same school) 
Guidance counsellor Principal 19 Department heads 93 
Assistant principal Teachers (other schools) 10 Assistant principals 15 
Principal School district 8 Principal 10 
consultants 
Remedial reading Provincial Department 1 Counsellors 9 
teacher of Education personnel alenrias 4 
Individual grade School district 
teachers consultants 
Librarian Teachers ») 
(other schools) 
Teachers’s aide Provincial Department 1 
of Education personnel 
Community personnel 1 
100 100 


"No percentage frequencies were obtained for the consultative personnel in the elementary school 
study. The rank ordering is based on data collected in the study (Haughey, 1976). 


oThis means that 61 percent of all mentions of assistance sought by teachers was provided by 
teachers’ colleagues in the same school. 


combined, the following concerns received over 90 percent: “Operating 
audiovisual equipment (100 percent); “Selecting instructional materials,” 
“Obtaining student background information” and “Evaluating lessons/units” (all 
97 percent); “Developing course outlines” (96 percent); and “Solving teacher-pupil 
problems” (91 percent). The highest percentages of elementary teachers 
unsatisfied with the assistance provided occurred with “Interpreting curriculum 
guides” (33 percent), “Using problem-solving/inquiry/discovery method” (25 
percent), and “Planning individualized instruction” (16 percent). 
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For junior high teachers, high levels of satisfaction were again obtained for 
the assistance with several concerns. Some similarities were obtained for both the 
elementary and junior high levels, e.g., again over 90 percent of the junior high 
teachers were Very Satisfied or Satisfied with “Obtaining student background 
information” and “Selecting instructional materials.” This level of satisfaction was 
also obtained for “Diagnosing individual differences” and “Establishing classroom 
control.” Senior high teachers were also Very Satisfied or Satisfied with 
assistance provided in “Selecting ‘best’ instructional materials” (100 percent). 
Over 90 percent of senior high teachers were Satisfied with assistance given for 
“Determining ‘best’? techniques for content presentation,” “Developing course 
outlines,” and “Obtaining information on budgeting/money control.” 


The two concerns receiving the highest Unsatisfied responses from junior high 
teachers related to remedial activities, with 50 percent recording this response for 
“Creating useful remedial materials” and 25 percent for “Diagnosing learning 
difficulties.” Also receiving high Unsatisfied percentages were “Selecting long-use 
instructional materials” (24 percent), “Developing course outlines” (21 percent) 
and “Developing long-use instructional materials” (20 percent). In contrast, the 
senior high items receiving the greatest proportion of Unsatisfied mentions were 
largely psychologically based, being “Dealing with tardiness and/or absenteeism” 
and “Improving student motivation” (both 37 percent). The next most common 
senior high Unsatisfied responses were obtained for “Determining expectations 
for student achievement” (27 percent), “Determining teaching 
time/subject-allocation” (25 percent), and “Determining established school 
program/standards” (18 percent). 

Unsought, needed assistance. Teachers in the three studies were asked to 
give their reasons for not seeking assistance when they felt that they needed it. 
Responses related to “lack of time to seek it” were first ranked in two of the 
studies, and second ranked in one. Senior high school teachers differentiated 
between the personal reason of “no time to seek it,” and the organizational 
reason of “no time is provided to seek it.” The elementary teachers’ first ranked 
reason was “no one to ask or didn’t know whom to ask.” 


Responses relating to time represented 12 percent, 18 percent, and 32 percent 
of the total reasons given by elementary, junior high, and senior high teachers 
respectively. Other reasons appropriate to each of the studies appear in Table 5. 
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TABLE 5 


REASONS GIVEN MOST COMMONLY BY TEACHERS 
FOR NOT SEEKING CONSULTATIVE ASSISTANCE 


Elementary Junior High Senior High 
(N=80) f (N=209) if (N=112) if 
No one to ask 18° Insufficient time 15 Not enough time vi 
or didn’t know 
No time 12 Didn’t know where to 9 No time set aside 5 
find information for consultation 
Not high priority 12 Information not Colleagues/administrators 4 
/major concern practical/vague too busy 
No wish to take up 11 Lack of availability 9 External consultants 4 
teacher’s time of personnel not available when 
needed 
Personal desire y) Previous experience 8 Never see external 3 
was negative consultants 
No one else teaching 9 Problem eventually 8 Teachers know more 3 
program solved itself than consultants 


on 


Workshop provided Availability of 5 
answers personnel 

and time to consult 

were incongruent 
Discussed generally 4 Red tape 4 


718 elementary school teachers stated a reason for not seeking desired consultative assistance as 
being “No one to ask, or didn’t know ‘whom to ask’.” 


Provision of assistance to colleagues. Data related to teachers’ provision of 
consultative assistance to their colleagues were obtained for each grade level. All 
three studies reported provision of assistance to same-school colleagues, while 
Harrison and Millikan also reported provision of assistance to other-school 
colleagues. 

An average of 36 percent of elementary school teachers across the three 
schools in the study were named by others as having provided consultative 
assistance to their same-school colleagues. However, 47 percent of these same 
teachers reported having personally provided assistance to colleagues in the same 
school, while 41 percent had done so for colleagues in other schools. 

Sixty-two percent of elementary school teachers reported having provided 
such assistance “at least twice per week,” while the majority of senior high school 
teachers reported having provided help to same-school peers at least “once or 
twice per year.” No frequency of such provision was reported in the junior high 


study. 
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Concluding Comments 


Detailed comparisons of the findings of the reviewed studies were somewhat 
restricted because of differences in instrumentation. Further, while similar data 
were collected, in certain instances the methods of reporting were different, as 
exemplified in Table 4. Nevertheless, the following generalizations are supported 
by the findings. 

1. A majority of the teacher respondents expressed need for consultative 
assistance for a majority of the educational concerns listed on the questionnaires. 


2. The staff classification most frequently consulted by teachers at all grade 
levels was “teacher colleagues in the same school.” 


3. At least half of the listed items were classified by a majority of teacher 
respondents in schools of the three different grade levels as representing 
recurring concerns. 


4. Teachers in schools of the three different grade levels reported overall 
satisfaction with the assistance provided, although significant levels of 
dissatisfaction were reported in relation to different categories of items for 
teachers of different grade levels. 


5. The predominant overall reason why teachers may choose not to seek 
needed consultative assistance related to a lack of time. 


Several differences were noted in the consultative needs and practices of 
teachers in schools of different grade levels. For example, elementary teachers 
identified the need for assistance with diagnosis of learning difficulties more 
frequently than did junior or senior high school teachers, while the reverse 
applied to developing course outlines. At the senior high school level, teachers 
identified their need for assistance with student problems and 
curriculum/program matters to a greater extent than did teachers in schools of 
the other grade levels. With respect to recurring concerns, the elementary 
teachers emphasized instruction and methodology, the junior high teachers 
identified concerns of a psychological nature, and the senior high teachers, while 
also mentioning psychological concerns, stressed expectations and evaluation. 
Smaller percentages of teachers had recurring concerns as grade level increased. 
Elementary teachers saw themselves as providing assistance to their peers more 
frequently than did the senior high teachers. 


Apart from the generation of a great deal of new information, the major 
contribution of these studies to our understanding of the consultative process in 
education lies in the support given to the importance of peer consultation as 
proposed by Parsons, Blumberg, Alfonso, Sergiovanni, and Lippitt and Lippitt. 
Another important contribution was the empirical support given to the notion 
that teachers in schools of different grade levels have consultative needs which 
show some similarities but also several marked differences which require 
alternative methods of resolution. 
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Recommendations for Practice 


The following list synthesizes the researchers’ recommendations based on the 
questionnaire and interview responses for improvement of the practice of 
educational consultation and the consequent reduction and/or alleviation of 
teachers’ educational concerns. 


1. Greater and more effective use should be made of in-school experts by 
providing them with more non-teaching time for consultation. 


2. More intensive in-school orientation seminars should be conducted by 
administrators. 


3. A more diversified network of specialists and generalist consultants should 
be established using both full-time and part-time personnel based both in schools 
and in central/area offices. System-level consultation should emphasize specialist 
services such as diagnosis of learning difficulties, while in-school consultation 
should relate more to generalist advice (Haughey, Holdaway & Small, 1977, p. 
95). Because of the more specialized nature of the training and experience of 
teachers in the larger senior high schools, these schools could become more 
self-sufficient in the provision of consultative assistance for a variety of concerns. 


4. The consultant’s role should be redefined to include less administrative 
responsibility. 

Findings from the three studies clearly show the existence of teachers’ needs 
for consultative assistance over a comprehensive range of concerns. The potential 
benefits of educational consultation for both professional growth of teachers and 
alleviation of their concerns appear to have been underestimated, and the 
practice underutilized. At a time when supervisory staff numbers are being 
reduced in many school systems because of declines in student enrolment, we 
need to know more about the practice and benefits of educational consultation. 


Recommendations for Further Research 

The studies reviewed in this article presented findings from rather small 
samples of teachers employed in school districts in the general area of 
Edmonton. While many of the findings are important and were to some extent 
predictable, other studies are required in order to obtain conclusions which are 
more generally applicable. 

Two different approaches could be taken in further research activities. First, 
studies using identical instruments in more geographic areas in Alberta and 
elsewhere are needed to obtain more data about consultative practices for a 
much larger number of teachers and consultants. Second, in-depth site studies of 
the interaction of teachers and those they consult should be conducted. For 
example, a researcher could with benefit examine these interactions in one school 
over a period of one academic year. Some of the theoretical perspectives cited 
above concerning the helping relationship, such as satisfaction of needs, should 
be included in future studies. Also, more data are required concerning practices 
in different settings, that is in classrooms, schools, and school systems of 
different type, size, and location. Such information could be of value to those 
involved in designing preservice and in-service teacher education programs. 
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CURRICULUM POLICY MAKING IN ALBERTA EDUCATION. By 
Max van Manen and Lorne Stewart (Eds.). Edmonton: Faculty of 
Education, The University of Alberta, 1978, 299 pp. & Appendix. 


Public policy is ultimately grounded in an appreciative setting which, like the 
unconscious mind of the psychologist, is mysterious but nonetheless real—an 
often inchoate, subliminal, and infinitely complex matrix of values, motives, 
interests, attitudes, and belief. Those who would change policy must seek to 
understand this setting and the book reviewed here is of great interest as a case 
study of an attempt to alter the appreciative setting of public education in the 
Canadian province of Alberta. 

The book is a response by leaders of the educational profession to a 
ministerial position paper (the Harder Report) proposing a radical restructuring 
of the Alberta curriculum. An educational debate is triggered. Professors Berk, 
Pearson, and MacDonald attack the assumptions, rationality, and logic of the 
report. The historical, religious, and socioeconomic aspects of curriculum policy 
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are instructively analyzed by Professors Carney, Hodysh, Quarter, and Mazurek. 
The pupil as focus of educational policy receives consideration through articles 
by Professors Schmidt, Therrien, and Mitchell. The tortured issues of 
accountability and evaluation are deftly treated by Professors Weiss, MacKay, 
MacDonald, and Nyberg while, finally, the role of the teacher practitioner 
receives careful consideration from Professors Brauer and Stewart. Taken 
together, some 300 pages of illuminating and cogently reasoned responses by 
professional experts all directed to the end of responding to a political initiative, 
and in some way altering the appreciative setting which underlies the educational 
reality. 

The classical strategies for the making of policy are really quite limited. At 
one extreme there is rational comprehensive planning which attempts to apply 
system analysis and theory in a logically rigorous manner and to maximize 
utilities according to the best canons of econometric science. It is strong on 
quantitative methodology; it seeks to allow for all contingencies; and it implies a 
clinically sterile separation of values from facts and ends from means. The 
Harder Report, despite its pretensions to a “systems approach,” does not fit this 
mold. 

At the other extreme there is what in plain talk is called “muddling through,” 
technically understood as “disjointed incrementalism” (Braybrooke & Lindblom, 
1963). That is, the making of small decisions at the margin of organizational 
developments which intertwine fact and value, which do not commit a system to 
large or irreversible changes, and which tend to perpetuate the status quo. The 
Harder Report does not fit here either; its suggested changes, in scope and 
intent, are too radical. 


Between these extreme modes of policy strategy we have the mixed scanning 
of Etzioni (1968) and the neomobilism of Stufflebeam and his colleagues (1971). 
But both of these middle ground strategies demand, at the least, certain 
requirements of tactical heuristics, conceptualization, and structured inquiry 
which Dr. Harder’s report does not seem to satisfy. So again it does not fit. And 
one is left with the question whether this is an exercise in policy-making at all, 
despite the assertion of the title. What is going on here? Is it simply the political 
penchant for the launching of trial balloons? In which case the concentration of 
flak from the vested interest of academe is devastating—there is nothing left of 
the balloon and its gas is thoroughly dissipated throughout the atmosphere. Or is 
it the more cynical political exercise of drawing fire in preparation for an already 
planned advance? These and other speculations are raised in the pages of the 
volume, but it does not seem particularly fruitful to linger over them. The thing 
of interest, the thing of practical value, seems to lie in the serendipity of the 
occasion for we have here, by happy accident, an almost-textbook of the 
conventional wisdom on curriculum thinking in contemporary Canada. Dr. 
Harder of Alberta Education, acting from whatever motivations, has produced a 
discussion paper for the Curriculum Policies Board of Alberta Education. It is a 
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second draft (the first, he says, having been “generally well received by the 
groups to whom it was presented”) and it purports, succinctly, to respond to 
“general dissatisfaction of the public with what they feel are short falls in the 
education system and the high costs of what they term mediocrity.” (Alberta 
Education and Diploma Requirements, p. v). This,it does by a truncated 
progression from the general (philosophy of education) to the specific (number of 
minutes of various required components in the curriculum). 


Let us be duly considerate. Let us accept for the moment the presumptions 
of public dissatisfaction in Alberta. Then this document and the 300 pages of 
variegated response that it generated is of interest because it presents us with a 
sort of syndrome for frustration. Frustration of the public, of the bureaucracy, of 
the vested professional interests, of curriculum of policy, of policy-making, of 
Alberta, of education, indeed of Curriculum Policy Making in Alberta Education. 
And perhaps of education generally in the developed world. Briefly the syndrome 
runs like this. 

There is necessarily inchoate feeling of dissatisfaction diffused amongst 
sectors of the public education system. The roots may be historical, cultural, 
irrational, resistant to analysis. They may be based on facts or chimera; certainly 
they will be affective and valuational. Some of this diversified and poorly 
articulated sentiment manifests at the various levels of the political system. 
Because of its poor articulation, its ambivalences and ambiguities, it lends itself 
well to political manipulation. It enters into the language games which are played 
in political-governmental-bureaucratic settings. The dissatisfaction then at some 
point becomes delicately or firmly crystallized (a Harder discussion paper; a 
Hall-Dennis Report); it achieves tentative expression. In doing so it permits, for 
the first time in the syndrome, organized reaction about a focus. First to respond 
are those with presumptive expertise. Concomitant with them respond the vested 
interests of the affected profession or professions. Typically there is a barrage of 
rhetoric, argument, analysis, and counter-analysis which tends simultaneously to 
overwhelm and undermine the first articulation which served as a trigger. All of 
this takes time. Energy is spent. Politicians and public may feel that action is 
occurring and work (in the sense of dissipated energy) is being done. The target 
system (schools + school bureaucracy) is tinkered with or tuned. More time 
elapses. Typically no radical or disruptive change occurs. The equilibrium state 
which generated the first phase of the syndrome is restored (if it was ever 
disturbed) and the sequence of events can be considered at an end until the 
historical play of public sentiment on the one hand and political sensors on the 
other conjoin to initiate another round of action and reaction. The waves wash 
upon the shore with greater or lesser turbulence and what is left behind is 
always a flotsam and jetsam of paper. 


Yet there is more to policy-making about the curriculum of education than 
the simplistic syndrome suggested above. The cut and thrust has intended and 
unintended consequences. The syndrome itself is purposive. As Sir Geoffrey 
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Vickers (1965) has shown, the whole policy-making process jis in effect an effort to 
alter an appreciative setting. The appreciative setting is the complex of values, 
attitudes, beliefs, and motivations of the actors in a given scenario. In any 
comprehensive sense, of course, this complexity is not amenable to rational 
analysis and manipulation but in practical policy contexts it can be radically 
simplified because of the emergence of strategic, salient, or focusing issues—value 
positions about which the contestants in an arena can organize their arguments. 
This process becomes apparent in authentic democratic election campaigns but is 
pervasive throughout all policy processes. In the instance under consideration 
there is contest of “appreciation” between the public acting at many removes 
through a bureaucratic spokesman and a profession acting at one remove through 
a series of academic spokesmen. The contest is uneven but the interesting 
question is to what extent is the appreciative setting altered after the exercise. I 
cannot answer this on behalf of the Alberta bureaucracy and government. I 
suspect, however, that on the part of the academic establishment the 
appreciation of the situation is now entrenched; certainly this document provides 
the ammunition for such fortification, for all the curricular bases are doubly 
barbed and armoured. 

The pattern is very clear in the instance being considered. At ministerial or 
sub-ministerial behest, a paper is produced. It is responded to in this volume by 
marshalled expertise within a university. The experts include historians, 
professors of administration, editors of learned journals, and authorities in 
elementary education, educational psychology, educational philosophy, and 
evaluation. Needless to say the trigger paper is damned, blasted, and demolished. 
The waves recede and the ship of Alberta Education sails on. Nothing 
exceptional here save that in this one instance, by a fortunate happenstance of 
editorial skill, assemblage of authors, and right institutional patronage, we have 
as a lucky product of this sequence a document which in one set of covers makes 
available for continued critical analysis the conventional wisdom of curriculum 
circa 1978. In a real and useful sense it constitutes a text in curriculum, applied 
educational philosophy, and educational administration. Because of this quality 
of coagulated curriculum thought and inquiry, it is worth drawing attention to 
some of the critical questions raised, points made, and issues vexed. 


First, the paradox inherent in educational policy making. It can be shown 
(van Manen, p. 12, citing Worth) to be a power-based process. The proposed 
remedy is to compensate by enhanced democratic participation. But this 
enhances the irrationality and political quality and initiates a sort of vicious 
circle of which the syndrome described above is a by-product phenomenon. And 
the logical alternative to this, the rational-comprehensive approach, can then be 
shown to be either (a) unworkable or (b) ideologically unacceptable. 

Much of the criticism of the Harder plan is directed against its connotations 
of management control, “payment by results,” accountability, the “systems” 
approach, the “management science” sort of thing. One is led again and again to 
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wonder whether in education there can be any new thing under the sun. The 
sense of déja vu is overpowering. Callahan (1962) recorded this movement for us 
so clearly in his Educational Administration and the Cult of Efficiency. 
Moreover, as one of the respondents points out, there is something automatically 
distasteful about a sandwich the meat of which is a quantitative specification of 
the number of hours to be spent upon “basic life skills” (which translates into 
little different than what presently obtains) and a series of managerial controls 
(mastery and competence examinations). This meat then being inserted between 
the bread of classical Greek idealism (Panathenaicus) as a fore-quotation and 
some sentiments of the prophet (Kahil Gibran) as an after-quotation. From the 
qualitative to the qualitative via the quantitative without benefit of visible 
logical connection. 


It is not, of course, that professional educators are resistant to accountability, 
or deficient in responsibility, or, worse yet, defensive of a claim on the flow of 
public funds which affords them opportunities for what the labour movement 
calls “featherbedding. ” Nor is it, on the other hand, that educators are 
necessarily less venal than lawyers, physicians, and bureaucrats. G. B. Shaw was 
close to the mark when he said that “all professions are conspiracies against the 
laity.” But the distinctive peculiarity of the education profession is its intractibly 
qualitative character. To apply systems reasoning is therefore especially difficult. 
It might result in such tacit or explicit propositions as “The throughput of the 
system is protoplasm.” To be sure it is—and the feedback loops are protoplasmic 
as well, as are the sensors in the political support metasystem which latter are, 
we presume, what Dr. Harder is most concerned about. Which means that the 
business of schools (and this subsumes all the managerial details of their 
curricula) is in the last analysis imponderable, ineffable, not susceptible to 
management by objectives. To ask about the purpose of schooling—even allowing 
Harder’s distinction between schooling and education—is to ask a question like, 
What is the purpose of the State? What is the goal of Canada? Of North 
America? It is not like asking, What is the purpose of the medical profession?, of 
the judiciary?, of the firm? It is to ask something about life chances, about the 
quality of life, about the metavalues of a given society and culture as it moves 
through historical time. And this kind of question does not easily yield answers 
for we are back again into the realm of the inchoate and often the (literally) 
unspeakable. 

Behaviourism is attacked and Harder is accused of this heresy. He is also 
accused, rightly, of illogic, inaccuracy, inconsistency. And .<ucational engineering. 
(p. 63). Other accusations follow. Pandering to a nonexistent discontent amongst 
the clientele: “throwing the public a bone” (p. 17). Really attempting to respond 
to a problem of growing unemployment (Dato) ads ralgcarecten concern for 
“quality control” based on a resurgence of the fallacy of a “science of education” 
(pp. 128, 133). The arguments presented by Professor Quarter are powerful, 
factual, suggestive, and persuasive enough to affect most appreciative settings. 
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He suggests a trend towards “puritanical functionalism” which would translate 
into greater centralized regulation and control (of protoplasmic flow and 
curriculum). Quarter’s historical survey of ideological change in education is very 
valuable and one wonders only whether he has taken into sufficient account the 
demographic aging of the Canadian population. 

The assaults continue and amid the rumble and thunder, the professional 
position becomes clear. The lines of the academic commitment are as follows: To 
schooling as education, that is, the major not the minor role. To an evaluation of 
education which is general rather than specific, that is not tied merely to 
competency based models and quantitative norms (MacDonald, who is British 
and the only double contributor to the polemic, writes convincingly about 
accountability, standards, and highly sophisticated modes of school evaluation). 
To humanism; to child-centredness; to social relevance of the curriculum (e.g., 
the lack of credibility of qualifications in “Practical Arts,” p. 168). To emergent 
as opposed to ante facto goals—since each child has to re-interpret the symbolic 
quality of the culture into which he is born. To a concern with the child’s own 
subjectivity and intentionality (p. 172). To communication viewed as “speaking, 
writing, drawing, singing, dancing, constructing, listening, reading, and viewing.” 
(One wonders whether in that order of priority?) To community involvement and 
a less sterile separation of school from society. To  self-actualization—the 
all-pervasive Maslowian model once again. To change! Not clear at all this last. 
It never is. 


And so it goes. A symphony of sorts which would not fall harshly upon an 
educator’s ears for it is, after all, an aggregate of themes and Leitmotiven with 
which he is thoroughly familiar; it is indeed the sound of orthodoxy, the 
conventional wisdom ex cathedra, with counterpoint provided by the chorus of 
“back to the basics.” It provides no surprises. It is the old neo-progressivism well 
modulated by conservatism. 


Here then we come to what is perhaps the chief and most useful function of 
such policy making exercises as the Harder discussion paper. It is, consistent 
with Vickers, to modify the appreciative setting. Each time the establishment is 
forced by such initiatives to put down in words its position, it is made to engage 
in self-scrutiny, the clarification and justifications of its values, and the 
modulation of those values in résponse to the shifting, evolving socio-cultural 
context. Given the shifts which have occurred in Alberta, Canada, and North 
America in that context over the last generation (among the most significant of 
which must surely be the media explosion and the pervasion of technology) and 
given the values diffusion and confusion (not to mention dilution) of 
contemporary society, it is remarkable that the dominant philosophy of education 
has endured with such little change and so few surrenders. One speaks here of 
philosophy not so much in any textbook or academic or rhetorical sense but as it 
“cashes out,” as it becomes manifest through the institutions and processes of 
educational administration. 
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In his concluding remarks Professor Stewart, a co-editor of the book, refers to 
this indirectly when he asserts that the curriculum process “implicit in the 
Harder proposals is a bureaucratic procedure exceedingly cumbersome and 
time-consuming which has historically proved ineffective” (p. 296). The 
alternative and the right way to go about curriculum development is, he says, at 
the school level of organization through cooperation between teacher, pupil, and 
parent with “freedom for these people to define objectives, select content, 
organize learning tasks and monitor progress” and freedom from “burdens and 
constraints which interfere with the performance of these tasks” (p. 296, my 
italics). An innocent might ask, Is this not a largesse of freedom? And a less 
innocent might ask, Who are these people? Do these people translate into 
teachers? And teachers translate into vested interests? The answers to all these 
questions would no doubt invoke the term “professional.” Indeed, the concept of 
an education profession is at the back of all that is presented in this book. This 
profession would not necessarily have an inaccessible and esoteric body of 
knowledge confined to its initiates (as is much the case in medicine and less so 
in law) but it would presumably have an acknowledged mastery over the 
problems defined by its rubric. What these problems are in education come 
through clearly in a reading of this book as does the degree and state of mastery. 
What comes through besides is the coherence and consensus of curriculum 
opinions and a sense of the power, latent and manifest, of the educational 
establishment. 


I began by recommending this work as a text for educators in the 
conventional wisdom of curriculum, but must end by recommending it no less as 
a text of desirable reading for all practitioners in the politics of education. Those 
who dabble in the art of the educationally possible may gain thereby while those 
who seek for accountability may seek forever unless they first learn that the 
questions they ask and the answers they create are dialectical, iterative, and 
proximate. 
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BOOK REVIEWS 


EVALUATING THE NEW B.ED. Edited by Gerald Collier. Surrey, England: 
Society for Research into Higher Education Limited, 1978, 120 pp. 


In teaching, as in acting or athletics or industry, there exists no shortage of 
assumptions explaining success. These include liking it, practising it, even having 
a natural talent for it. One latter-day supposition about teaching is that if you 
train for it at a university, you’ll do a better job. Albertans adopted this stance 
back in 1945 and Canadians elsewhere at length fell into line. In Britain, 
however, moving preservice teacher education largely on campus happened more 
recently, following the 1972 James Report and the White Paper, Education: A 
Framework for Expansion, appearing the same year. Several consequences of 
these documents— the assumption that teachers ought to hold degrees, that 
teacher education should be carried out in larger institutions, that training 
courses (we would say programmes) should undergo periodic scrutiny by the 
Council for National Academic Awards—prompted Evaluating the New B.Ed., a 
collection of conference paper competently edited by Gerald Collier of the 
University of East Anglia’s Centre for Applied Research in Education. 


You will readily discover that this book is methodologically oriented. Granted, 
Staff Inspector Ian Butterworth and CNAA Officer David Francis offer personal 
impressions of how, despite academic politics, newly-located programmes have 
tried to reconcile school experience with university studies. Admittedly, 
Nottingham’s James Eggleston and Ted Wragg proffer down-to-earth approaches 
to programme evaluation, warning respectively of crude eclecticism and narrow, 
competency-based teacher education calculated merely to prepare novices “for 
schools as they are.” To be sure, Peter Miller of Keswick Hall College and Robin 
Alexander of Didsbury College, Manchester and the University of Leeds reflect 
on problems arising from the practical, political, and psychological sides of 
evaluation, including establishing institutional states of mind receptive to it, and 
realizing as it is carried out that the character of good teaching can be 
unpredictable and “essentially elusive.” Just the same, strategy and methodology 
occupy the larger share of available space. Proponents include Michael Eraut 
from Sussex; Robin Alexander; Roland Hoste of Leeds and Stirling; Clem 
Adelman from Bulmershe College of Education; Birman Nottingham of St. 
Martin’s College, Lancaster; and Dan Wicksteed, Worcester College of Higher 
Education. At a time when accountability in some form or other dogs the 
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footsteps of many an education faculty, what they have to say will interest all 
engaged in or answerable for teacher education. 


A succinct argument for sound research design, Eraut’s paper demonstrates 
the conflict between intrinsic and extrinsic purpose in evaluation, the one serving 
a research, the other, a political community. It reviews the relative merit of 
preformulated hypotheses compared with more open-ended procedures. Finally, it 
considers evaluation as a normal social process best effected through a 
systematically-elaborated agenda negotiated with client groups as to content, 
purpose, and needed cooperation. Alongside this theoretical piece appears 
Alexander’s lengthier account of setting up an actual internal evaluation at 
Didsbury College. Instigated to improve teaching, the exercise constituted an 
agreed-upon feature of Didsbury’s administrative policy and was carried out by 
staff and students lest evaluation “become a tool of senior management.” In order 
to produce a “cumulative profile,” Didsbury utilized questionnaires, open, written 
responses, interviews, and meetings with school teachers. They also observed 
classes in session, consulted college records, and in one instance invited an 
outside opinion. Hoste’s briefer offering draws attention to typical impediments. 
The internal evaluation proves unaware of blind spots. Staff social relations or 
student anxiety over results contaminate objectivity. Sampling is narrow. By 
contrast, Adelman writes about an independent evaluation of three colleges. His 
team was affiliated neither to staff nor students and his data governed by an 
“ethical framework” intended to protect those interviewed. Two additional essays 
complete this section. One, Nottingham’s “The First Stages of an Evaluation 
Exercise, ” stresses the early need to get a feeling for the programme as a whole. 
The other, Wicksteed’s “How to Begin an Evaluation Programme,” identifies 
characteristic objections to evaluation including the contentions that we should 
teach, not evaluate, that we in any case regularly evaluate student performance, 
and that curriculum design is a matter of professional judgment, not majority 
opinion, particularly from students! Overcoming this sort of skepticism occupies 
the author for the balance of his summary. 

While preparing Evaluating the New B.Ed., Gerald Collier no doubt 
experienced the usual challenges confronting conference paper editors. For one 
thing, contributions appear uneven as to bulk, content, and style. Quite apart 
from their intentional variance—theoretical as opposed to practical purpose; 
analytical as distinct from descriptive realization—some attain lean objectivity 
whereas others bog down a bit in awkward metaphor or self-conscious 
autobiographical flourishes. I compare here Alexander’s structured composition 
with Francis’ penchant for machine imagery or Nottingham’s fascination over his 
own status as an Anglican clergyman. For another thing, Collier faced the 
inevitable bits and pieces that conference proceedings generate and tucked them 
in as best he could. Hence, discussion in varying length augments some but not 
all papers. Where appropriate, notes and references follow. Some papers are set 
off so as to show clearly their beginning and ending. Others run visually one into 
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the next. In a low-budget publication, though, these organizational peccadilloes 
are of little consequence. I don’t envy a conference editor his task and concede 
that editorially, this indexed collection reads well enough. 


With regard to theme, I repeat that the chief concern is method. Evaluating 
the New B.Ed. doesn’t really assess changes triggered by the 1972 James Report 
and White Paper, though such a book would be welcome. Rather it furnishes 
impressions, opinions, even models of how, in particular institutions, we might go 
about assessing such changes. Happily for the book’s credibility, contributors 
warn time and again of the probable idiosyncratic nature of evaluations to which 
they refer. Overall, though, the work effectively highlights the difference between 
internal and independent evaluation, the one approach constantly menaced by 
“occupational tunnel-vision” (Scriven’s phrase), the other occasionally vulnerable 
to political misuse. With its chapters anchored in a rich and recent literature of 
curriculum evaluation, its contentions enjoy high transfer potential of exceptional 
value to teacher educators, their wider clientele, and those who legitimize and 
fund their efforts. 


John Calam 
Faculty of Education 
University of British Columbia 


TEACHER PARTICIPATION: A SECOND LOOK. By Robert B. Carson and 
John W. Friesen. Washington, D.C.: University Press of America, 1978, 97 pp. 


Teacher participation in decision making has been an important area of study 
for a number of years. However, problems related to it are still largely 
unresolved. As a consequence, attention to this area is vital in order to develop 
understanding of the dynamics of shared decision making in schools. 


This book reports research on the nature and degree of teacher participation 
in decision making. Its significant contribution, though, rests on the attempt to 
make a longitudinal study of changes in teacher participation in decision making. 
The authors are concerned with the question that is always important, “Who 
influences the choice processes that structure educational opportunities for kids?” 
They are also interested in what happens over a period of time to participatory 
decision making. 

The authors make important contributions both in substance and in 
methodology related to their theme. The impact of their presentation could be 
aided by the three suggestions that appear to be justified from a reading of the 
text. These will be explained briefly. 
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The authors uncritically accept certain assumptions that one might think 
would need further elaboration and support from theory or other research. For 
instance, they assume that the reader is fully aware of the nature of participation 
in decision making. There is no attempt made to define the complex concept 
which is the subject of the study. Presumably this has been addressed in the 
earlier work on teacher participation. However, the failure to clarify the subject 
of the research presents some difficulties for the reader. 


Another assumption of the authors is that the reader will be able to bridge 
the gap between the literature review and the study itself without being given a 
paradigm or conceptual framework. In fact, the literature review deals with a 
number of studies on the extent of teacher participation in decision making, but 
hardly touches on changes in the desire for participation by teachers over time, 
which happens to be the main thrust of the study. 


One more example of a questionable assumption relates to the inferences 
drawn from the data. The reader has difficulty seeing how the responses on the 
four-point scale can be reduced to a single percentage figure for each item, 
especially when the scale is not a continuum from no involvement to full 
involvement in decision making. For instance, how did the researchers obtain the 
data that 96.9 percent of the teachers “consider formal participation in decision 
making by teacher to be appropriate” for curriculum planning and development? 

The book presents an interesting opportunity for longitudinal studies on 
changes in teacher participation in decision making. Difficulties with this type of 
research are usually almost insurmountable. Hence, the book is a laudable 
achievement. The fact that weaknesses are apparent does not deter from the 
significance of the theme and methodology inherent in the book. It can serve as 
a stimulus for further research on trends in participatory decision making in 
schools. 

Placed in proper context, and viewed within the limitations mentioned above, 
the publication points out again the complexity in the normative view of 
participation in decision making. The desire of teachers for more participation 
may have increased slightly during the nine-year period. However, teachers are 
generally “content with participation in matters related to their classrooms. ” 
Administrative matters and policy decisions still do not call for significant 
teacher input. Nor do teachers generally demand a greater input in those areas. 

Administrators seem to see teachers as formal participants in a number of 
areas of decision making, even though teachers are not seen as having the 
authority to establish policy in any area. 

The last chapter of the book contains valuable information on participation in 
decision making in schools. It could be especially useful for researchers, teachers, 
and teachers’ associations. 


D. Friesen 
Department of Educational Administration 
The University of Alberta 
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PIAGETIAN THEORY AND THE HELPING PROFESSIONS. Proceedings of 
the 7th Annual Conference, Vol. 1. Edited by Roma Wergmann, Richard Brown, 
Philip J. Levinson, Priscilla A. Taylor. Los Angeles: University of Southern 
California, 1978, 432 pp. 


The work under review, Piagetian Theory and Its Implications for the 
Helping Professions, consists of the proceedings of the Seventh Interdisciplinary 
Conference of the same name. The 1977 conference was sponsored by the Schools 
of Social Work and of Education, University of Southern California, and the 
University Affiliated Project, Children’s Hospital of Los Angeles. Emphasis 
during these particular sessions was on the implications held by Piagetian theory 
for social work and psychological services. 

This volume of the proceedings consists of the two keynote presentations, 
delivered by H. W. Maier and P. Mussen, plus 45 submitted papers selected on 
the basis of their relevance to the conference theme. Despite the heterogeneity of 
interests reflected in this book, a majority of papers are focused on cognitive 
development in subjects suffering from some form of impairment. In addition, a 
number of studies are also presented which examine basic issues in Piagetian 
Theory (for example, the concept of knowledge, the assessment of formal 
operational thought, etc.). Of particular interest to this reviewer was the 
comprehensive bibliography of scholarly works on moral development for the 
period 1970-1976, prepared by Clifford Lange and the paper by Barbara Kane 
pertaining to a Piagetian discussion of the development of children’s concepts of 
death. In general, due to the multifaceted and interdisciplinary nature of the 
conference proceedings, the resulting compendium might best be viewed as a 
research journal. 

Although a number of papers will prove interesting to the Piagetian 
researcher, this reviewer restricted himself to the keynote presentations. Maier’s 
paper is unique and intriguing in that he applies Piagetian concepts and 
precepts, specifically those associated with the sensory-motor phase of 
development, to the initial stages of the helping relationship. The theme of the 
paper is best summarized in Maier’s own words: 


In essence I maintain that a whole period of development (more than a year of 
impactful life experiences) precedes language development. Similarly a _ whole 
progression of interpersonal experience at the beginning of the helping continuum 
influences the progression of the helping process. These beginnings always precede a 
more definitive assessment and treatment undertaking. (p. 12) 


Lae 


Mussen’s paper, dealing with the roots of pro-social behaviour in children, 
also proves useful and stimulating. He provides an excellent review of the current 
research and literature concerning his chosen topic, noting the importance of 
Piaget’s formulations on moral development to this area, but goes on to state 
that “Piaget’s work is of some, but only minor, help.” Mussen then proceeds to 
examine the development of pro-social behaviour from a variety of perspectives 
including (1) moral judgement, (2) socialization, (3) the effects of modelling, (4) 
identification and generosity, (5) studies of adult altruists, (6) parental 
nurturance, (7) the use of power assertion and induction as disciplinary 
techniques, and (8) maturity demands and the assessment of responsibility. 


Mussen concludes, as a result of his study of the area, that scientific findings 
often confirm much of what is professed in folk wisdom. In Mussen’s opinion, 


There are a number of substantial truths about child rearing practices and prosocial 
behavior that can be distilled from the findings of these studies. They are not 
surprising or sensational; in fact, perhaps one of the most interesting general 
conclusions is that techniques that many sensible people have used for millenia are 
conducive to the development of prosocial behavior: modeling prosocial behavior, 
especially be a nurturant model, and identification with a _ prosocial parent; 
induction— that is, reasoning and explanation— rather than power assertion in 
disciplining the child; making reasonable maturity demands. It is comforting to find 
that scientific findings confirm that much of what is advocated in “folk wisdom” 
actually works. (p.27) 


Mussen ends his paper with a note of caution. He emphasizes that we are 
still far from a complete understanding of the problems of  pro-social 
development and pro-social conduct and stresses the need for a number of 
further investigations, particularly those of a naturalistic nature. He believes this 
approach preserves the complexity of the situation while allowing the investigator 
to examine how the determinant variables of pro-social behaviour interact to 
enhance or inhibit the development of this form of behaviour. 


As was stated earlier, this volume is a compendium of topics, perspectives, 
and research findings based, in varying degrees, on Piagetian theory. Its strength 
lies in the honest attempt being made by the conference organizers and others to 
apply Piaget’s concepts to the helping professions. Conversely, the current state 
of the art constitutes the book’s greatest weakness; although specific papers are 
thought-provoking and informative, the volume lacks systematic organization and 
structure. In the opinion of this reader, the proceedings mirror the diversity of 
opinions and interests centered around the use of Piagetian concepts in applied 
settings and, as such, are worthy of consultation and perusal by those interested 
in this area. 


L. L. Stewin 
Department of Educational Psychology 
The University of Alberta 
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PREPARATION OF MANUSCRIPTS 


. All manuscripts must be typewritten, double spaced, and submitted in 
duplicate. An abstract of approximately 100 words in length, typed on a 
separate page, should be provided. 

. Tables must be numbered in Arabic numerals with the word ‘Table’ centered 
and in capital letters, e.g., TABLE 4. The heading of the table is to be centered 
below and typed in capitals. The format of tables should conform to the 
specifications in the APA Publications Manual. 

. Graphs and charts should be used only if essential. They must be carefully 
prepared on separate sheets in India ink, ready for reproduction. Graphs must 
be properly labelled using Arabic numerals, e.g., Figure 3. 

. Each table or figure should be presented on a separate page. The position of 
tables and graphs should be clearly indicated within the text by inserting at the 
relevent point the phrase (Insert Table 2 here). 

. References should appear in parentheses following the reference citing the 
author’s name (unless the name appears in the text), the year of publication, 
and page number if appropriate. For direct quotations, the reference should be 
cited and the page number given in brackets before the final punctuation of the 
quotation. The references should be listed alphabetically by author’s last names 
at the end of the manuscript under the heading, References. 

. Explanatory notes, numbered consecutively and identified in the text with a 
superscript, may be included under the heading of Notes. They should be 
double spaced and placed at the end of the manuscript immediately preceding 
the References. The citing of references and quotations in the Notes should con- 
form to the procedures outlined in No. 5 above. 

. Spelling shall conform to the Oxford English Dictionary or to the Dictionary of 
Canadian English, except in the case of direct quotations which must conform 
to the original. Editorial alterations will be made if necessary. 

. In matters of style, the APA Publications Manual is considered definitive. 
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AJER: The First Twenty-five Years 


The Alberta Journal of Educational Research has served the 
interests of educational research in Canada and abroad since its 
inception in 1955. In its commitment to the dissemination, criticism, 
interpretation, and encouragement of all forms of systematic enquiry 
into education, the Journal has acted as a vital force in the publication 
of research results of new as well as established scholars in psychology, 
administration, curriculum, sociology, history, and philosophy. 

The current issue of the Journal commemorates this long-standing 
commitment in presenting Professor Naomi Hersom’s perspective on 
educational research and Professor Andrew Clark’s comprehensive 
bibliography of articles published in the AJER during its first 
twenty-five years. 

I wish to extend sincere appreciation to contributors, reviewers, 
subscribers, and those associated with the AJER who over the years 


have supported its development. 


Henry W. Hodysh 
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and 


JOHN W. JARRETT 


Carleton Board of Education 


Student Involvement and the 
Intermediate School 


Among the stated purposes of the middle school is the provision of 
opportunities for the development of student independence and individuality. 
Behaviour setting theory provides a framework for analyzing student behaviour 
in various school settings. The involvement in school activities of grade 8 
pupils from ten K-8 schools was compared to those from ten intermediate 
schools. A student involvement survey instrument was carefully developed. The 
fourth edition yielded a test retest correlation of .94. As an index of validity, 
the correlation between teacher and student ratings of student activity was .70. 
Almost 600 students participated in the study. The results support the 
hypotheses emanating from the theory. Grade 8 pupils in K-8 schools tend to 
be more involved in school activities than do their peers in intermediate 
schools. In this major outcome at least, K-8 schools tend to serve the needs of 
young adolescents better than do intermediate schools. 


The provision of special schools for students in the intermediate grades is not 
a new idea. The junior high school movement can be traced back to the early 
part of the century. The most recent trend in intermediate education is the 
growth in the number of middle schools. In spite of the wide popularity of these 
schools, there has not been much research on the efficacy of this type of 
organization. Even such an eminent authority as Alexander admits that research 
findings are inconclusive. He says, “I wish I could tell you that research has 


Dr. O’Reilly is Professor, Educational Studies Section, Faculty of Education, University of Ottawa. His interests are 
management and evaluation of educational and social programs. He is currently on leave to serve as Executive 
Director of the National Legal Aid Research Centre. 

Dr. Jarrett (Ph.D., educational administration) has been a principal of elementary (kindergarten to grade 8) schools 
and intermediate schools. He is now Superintendent of Special Education with the Carleton Board of Education. 


221 


R. R. O'Reilly and J. W. Jarrett 


firmly established the idea that the middle school is a better organization than 
any other organization. I can not do it” (1969, p. 3). 


Wattenburg (1967) claims that there is no evidence to indicate that pupils 
who go through a system with a separate intermediate school level differ in any 
way from those who go through a two-level system. Chellew, Baker, Baker, 
Dimitri & Johnson (1973), after making a comprehensive review of studies on 
intermediate schools, conclude that the research fails to demonstrate that 
intermediate schools are superior to K-8 schools in terms of either achievement 
or student self-perception. Soares, Soares & Pumerantz (1973) suggest that 
intermediate schools may be less desirable: 

traditional schools are more likely to be neighbourhood schools, while middle schools 

draw from many sections of the city. Continuity and security emanating from the 


known and the familiar are generally more conducive to higher self-perceptions. 
(p. 387) 


Similarly, Mead (1965) claims that segregating pre-adolescents by placing them in 
intermediate schools divorces them from the reassurances and the security that 
are inherent in a continuing association with children like their own recent past 
selves. 


Educators are generally agreed on the rationale for intermediate education. 
They state that unless the characteristics and needs of the students underpin the 
intermediate school program, there is probably little justification for establishing 
such schools. Popper (1967) states that “if the human condition of early 
adolescence ceases to be the dominant focus of middle school education, then the 
old pattern of an 8-4 school organization would have a more valid administrative 
rationale” (p. 6). 

The early adolescent’s striving for independence and emancipation are among 
the important special needs of this age group which should be provided for in the 
intermediate grades (Desjarlais & Rackauskas, 1975). Certain aspects of the 
intermediate program, therefore, should foster the development of a sense of 
autonomy and individuality in the students (Schmuck, 1965). 


The aspect of the intermediate education program that appears to lend itself 
best to the development of independence and individuality is the activities 
program (Reubel, 1973). This part of the program, in the opinion of a number of 
writers, has the greatest potential for fostering leadership and social skills in the 
students (Havighurst, 1968) and for developing a sense of community in 
preparation for life in a democratic society (Overly, Kinghorn & Preston, 1972). 


Purpose of the Study 


The question is: Are intermediate schools best able to provide for the special 
needs of early adolescents? Certain writers appear to have doubts about this. The 
level of participation in school activities in intermediate schools is questioned 
and, consequently, the efficacy of these schools is also questioned (Overly et al., 
1972). Chellew et al. (1973) speculate that participation of intermediate grade 
students may be greater in K-8 schools than in intermediate schools. 


The general purpose of this study is to compare the level of participation of 
grade eight students in two types of school environment—intermediate schools 
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and K-8 schools—in order to investigate the effectiveness of the intermediate 
school environment in generating student behaviours that are in keeping with the 
aims of intermediate education. 


Theoretical Rationale 


Behaviour Setting Theory 


The basic postulate of the study is that environment and behaviour are 
directly linked. Behaviour setting theory, upon which this study was developed, 
states that behaviour is largely controlled by the environmental setting in which 
it occurs (Willems, 1974). The theory has been elaborated by Barker (1968) and 
his colleagues. According to the theory that has emerged from these studies a 
setting is a homeostatic system with controls that maintain the setting intact, 
operating at a stable functional level under widely varying conditions (Barker & 
Gump, 1964). This homeostatic system consists of multiple, independent 
elements. For every homeostatic level there is an optimal number of elements. 
When the optimal number of elements is present, the behaviour setting is said to 
be optimally manned. When the number of elements falls below the optimal 
level, that is, when the setting is undermanned, important consequences follow. 
In the undermanned situation, because the homeostatic controls maintain the 
functional level of the setting, the same pattern and strength of forces must be 
distributed among fewer inhabitants. The impact of undermanned settings on the 
individuals who occupy thern results in: (1) greater effort in the form of harder 
work or longer hours; (2) more difficult and important tasks undertaken; (3) a 
wider variety of activities engaged in; (4) less sensitivity and less evaluation of 
differences between people; (5) lower level of maximal or “top” performance; (6) 
greater functional importance within the setting; (7) more responsibility; (8) 
greater functional self-identify; (9) lower standards and fewer tests for admission; 
(10) greater insecurity due to increased dependence upon others; and (11) more 
frequent occurrences of success and failure (Wicker, 1967, p. 6). 

A behaviour setting can also shape a sense of responsibility or obligation in 
its inhabitants. Willems (1967) states that: 

induced forces, or environmental feedback (a) define the person’s social obligations 
for him and direct him to it and (b) inform him of possible consequences of his 
actions in the form of sanctions and rewards. As a result of this definition and 
direction and potential return, an internalized, subjective disposition, that is, sense 
of obligation is shaped. (p. 1249) 


Behaviour Setting Research 
Behaviour setting research that has been conducted in schools has centred 
largely on the question of size. Barker and Barker (1964) claim that these studies 
have revealed what they call the school size illusion. 
We have discovered that small high schools are, in fact, not so small on the inside 
as they are on the outside. In terms of number of behavior settings, number of 
inhabitants per setting—interior characteristics not easily seen from the 
outside—small schools differ less from large schools than in terms of number of 
students and amount of space, which are perceptually salient external attributes of 
schools. (pp. 62-63) 
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In an investigation that involved a comprehensive behaviour setting survey of 
high schools ranging in size from 35 pupils to 2,287 pupils, Barker and Barker 
(1964) discovered that as the schools increase in size, the variety and richness of 
behaviour settings in the schools increases at a much slower rate. They concluded 
that smaller schools manage to sustain a large proportion of the types of 
behaviour settings provided by larger schools. 


Gump and Friesen (1964a) studied student participation in nonclass settings 
or extracurricular activities. They chose to investigate the nonclass setting 
because the most adequate field methods available to them made use of the free 
decisions of students to enter or not to enter particular behaviour settings. They 
felt that it was essential to avoid settings where student participation is 
determined by school officials. They state their findings as follows: 

This study demonstrated that a large school provides a somewhat larger number and 
wider variety of nonclass activities than a small school . . . but the small school 
provides a higher proportion of settings to the number of students; this has the 
following consequences for the student’s participation in activities: (a) Small school 
students participate in the same number of settings commonly regarded as 
extra-curricular as do large school students. (b) Small school students participate in 
a wider variety of extra-curricular activities than do students in a large school. (c) A 
much larger portion of small school students hold positions of responsibility and 
importance. (d) Finally, small school students hold responsible and central positions 
in a wider variety of activities than do students in a large school. (pp. 92-93) 

Another study relating to students’ feelings and experiences in_ school 
behaviour settings was conducted by Gump and Friesen (1964b). Students were 
asked to select from a list of activities that they had entered those which they 
felt were satisfying and worthwhile. These students were also asked to tell what 
they liked about these settings. A content analysis was applied to these responses 
and revealed that the students in the small schools derived a greater range of 
satisfaction from their participation in nonclass activities than did students in 
large schools. The general conclusion that Gump and Friesen came to as a result 
of this research with junior high school students was that: 

large and small school Juniors derived different satisfactions from their nonclass 
settings and that these differences were causally related to differences in the number 
of students who occupied more central, important positions and the number of such 
positions held by each student. (p. 114) 
The evidence demonstrated that students in small schools occupied a greater 
proportion of central and responsible roles. 


Hypotheses 


The basic assumption in this study is that school activity settings in K-8 
schools are relatively undermanned in comparison to those in intermediate 
schools. In general, K-8 schools have fewer grade eight students than 
intermediate schools have. 

The general hypotheses of this study are as follows: 

1. Grade eight students in K-8 schools are more involved in school activities 
than grade eight students in intermediate schools. 

The responsibility for the maintenance of school activities assumed by grade 
eight students in K-8 schools is greater than that assumed by grade eight 
students in intermediate schools. 
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Methodology 


Student Involvement Survey Instrument 


The instrument used to gather the data reported in this paper was the 
Student Involvement Survey (SIS). This instrument was based on the research 
done by Gump and Friesen (1964a). 


On the SIS, students are asked to report on their involvement in school 
activities under a number of categories. These categories are based on the 
varieties of nonclass behaviour settings identified by Gump and Friesen (1964). 
When completing the SIS, the students are instructed to think back over the 
past school year and recall their activities under each of the following twelve 
categories: 

1. Games with Other Schools 
2. House League, Intramural Sports 
3. Plays, Concerts, Public Performances 
4. Music 
5. Parties, Dances 
6. Money Raising, Charitable Projects 
7. Hobby Clubs, Special Interest Groups 
8. Service Clubs, Helper Groups 
9. Students’ Council, School Elections 
10. School Newspaper, Year Book 
11. School Offices 
12. Library, Resource Centre 


Each student lists the activities within the categories in which he or she was 
involved and indicates the nature of the involvement by placing a check mark 
under the appropriate Performance Information heading. The Performance 
headings on the SIS were derived from the different levels of penetration or 
zones of centrality defined by Barker (1968, p. 51). Three column headings giving 
three general levels of penetration were included on the instrument. Check marks 
in the first column indicate a relatively minor involvement at the audience or 
customer level. Check marks in the second column indicate a more active 
membership such as a worker or a helper while check marks in the third column 
indicate a participant who is more directly responsible for the maintenance or 
control of the activity, such as an organizer, leader, executive officer, or player. 


In addition to recording performance information, each student indicates the 
approximate number of times he or she was involved by checking the appropriate 
column in the Number of Times section of the instrument. This section of the 
instrument has three columns with the following headings: Once only, Two to 
five times, and Six or more times. 

In order to develop and refine the Student Involvement Survey form, a series 
of pilot studies was conducted. Prior to the first pilot study, a preliminary SIS 
form was discussed at a seminar group consisting of teachers and graduate 
students from the Faculty of Education at the University of Ottawa. As a result 
of the comments and suggestions from members of this group, the categories of 
activities on the instrument were defined more precisely and the format was 
modified. 
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In Pilot Study Number One, the first draft of the SIS was used with 30 grade 
eight students—19 boys and 11 girls—in an intermediate school. The responses of 
these students were studied and a second draft of the instrument was drawn up. 
This second draft was checked for content validity at a second seminar of 
teachers and graduate students. As a result of this seminar presentation and 
subsequent discussions, the format of the SIS was changed further and additional 
performance information was included. The third draft of the SIS that came out 
of this process was administered to the 30 students who had completed the first 
draft. The interval between the administration of these two versions of the SIS 
was three weeks. On the first draft, the students indicated a total of 408 entries 
into nonclass behaviour settings in their school; on the third draft, the same 
students reported a total of 486 entries. 


The similarities between the responses of the students and between the total 
number of entries reported on the SIS indicated that measuring student 
involvement by asking individuals to recall their personal involvement was 
reasonably reliable. 


The third draft of the SIS was also used in Pilot Study Number Two. In this 
study, 34 grade eight students in another intermediate school were used. These 
students completed the third draft of the instrument on two different occasions 
eight days apart. The correlation between the total scores obtained on these two 
dates was .91. 


In order to get a further indication of the validity of the SIS, two separate 
rankings of the students were derived from the scores obtained by the students 
on the two occasions on which they completed the instrument. These rankings 
were compared with a ranking of student involvement made by the homeroom 
teacher of these students. The homeroom teacher’s ranking was compiled by 
using a card-sort method. The correlation of the first set of rankings with the 
teacher’s was .47 and the correlation of the second set was .53. 


A fourth draft of the SIS was then prepared for use in Pilot Study Number 
Three. The changes incorporated in the fourth draft were minor. This version of 
the SIS was administered twice to 23 grade eight students—14 boys and 9 
girls—in a large K-8 school. The two dates of administration were nine days 
apart. The correlation between the total scores obtained by the students on the 
two dates was .94. 


The rankings derived from the two sets of scores were correlated with the 
homeroom teacher’s rankings. The correlations were .68 and .70 respectively. 


Following Pilot Study Number Three, a final version of the Student 
Involvement Survey form was prepared. Minor changes in the wordings of certain 
definitions were made, the Special Events category was incorporated into other 
categories, and a page with general instructions and a section for student data 
was added. 


Scoring the Student Involvement Survey Instrument 

In order to score the SIS, separate weightings were assigned to each of the 
three levels of penetration. The weightings were as follows: Level 1 was weighted 
1 and levels 2 and 3 were weighted 2 and 3 respectively. Similarly, the following 
weightings were given to the three columns in the Number of Times section: 
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Once only received a weight of 1; Two to five times was weighted 2; and Six or 
more times received a weight of 3. The total of a student’s weighted performance 
scores and weighted number of times scores was deemed to be the measure of his 
or her involvement in school activities. The sum of all student totals for a given 
school or group of schools was the data that constituted the index of involvement 
for the school or group of schools in question. 


The number of times a student entered into activities at the different levels 
was considered to be a measure of the quality of the involvement and the degree 
of responsibility that the student assumed for the maintenance of the activity. 
The total number of entries into each of the three levels of penetration for a 
school or group of schools constituted the raw data for comparing the quality of 
student involvement between schools or groups of schools. 


The Research Population and Sample 


The research sample in this study was drawn from ten intermediate schools 
and ten K-8 schools in Eastern Ontario. 


At the time, the ten intermediate schools were all of the schools of this type 
in the region that had been in operation for two or more years. Six schools were 
located in the City of Ottawa and the adjacent suburban communities; the others 
were located in three relatively small cities within a 120-mile radius of Ottawa. 
Of these schools, nine housed grades seven and eight only and the remaining one 
had a small grade six in addition. The average enrolment of these schools was 
613, the smallest having a student population of 319 and the largest, 936. The 
number of students in the grade eight classes ranged from 158 to 558 with an 
average of 311. 

The ten K-8 schools included in the study were located in the same 
geographical region. The attendance area of each of these schools, with one 
exception, was the same for all grades including kindergarten. In the one school, 
the intermediate grade students were drawn from a wider attendance area. The 
average enrolment of these schools was 482, ranging from 235 to 850. The 
number of grade eight students ranged from 30 to 132, with an average of 61. 

A random sample of 15 boys and 15 girls was drawn from the grade eight 
students in each of the twenty schools. The total number of subjects was 597, 


297 in the intermediate schools and 300 in the K-8 schools. 


Analysis of the Data 

The major independent variable in this study was the type of school — K-8 or 
intermediate. The dependent variable was the participation of the grade eight 
students in school activities. 

The main purposes of the statistical analysis were to compare the level of 
participation in school activities of grade 8 students in the K-8 schools and the 
intermediate schools included in this study and to compare the degree of 
responsibility for the maintenance of these activities assumed by these two 
groups of students. 
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To this end, a multivariate analysis of variance with nested factors was used. 
This test compared the levels of participation in the two types of schools with 
schools tested across the twelve school activities selected for this study and 
across the three levels of penetration. 

A post hoc univariate test was also used to indicate which of the activities 
and which of the levels of penetration accounted for the major differences 
between the two groups. 

In all cases the MANOVA was carried out using the program of the 
Department of Educational Research Services Library of the University of 
Alberta, Canada (DERSLIB). 

The level of significance was set at .05. The Multivariate F ratios for the 
hypotheses indicated that the differences were significant (p<.001). 


TABLE 1 


F RATIOS FOR UNIVARIATE TESTS OF EQUALITY OF STUDENT 
INVOLVEMENT IN TWELVE SCHOOL ACTIVITIES 


School Activities Inter. K-8 


Mean Mean - a 

1. Games with Other Schools 4.4 9.7 L033 arr00 

2. House League Intramural Sports 18.1 BOT 10a 00 

3. Plays, Concerts, Public Performances 6.5 6.6 Oe eS) 

4, Music 2.0 oul 16:35 .00 

5. Parties, Dances 3.9 a0 Oe 02 

6. Money Raising, Charitable Projects 6.4 6.9 LS aha 

7. Hobby Clubs, Special Interest Groups I 2 TAY 24 

8. Service Clubs, Helper Group 126 305 ATT. OO 

9. Students’ Council, School Elections ed. 3.0 Soule OO 

10. School Newspaper, Year Book 220) Zk 02a 466 
11. School Offices 9.3 8.2 G20) 
12. Library, Resource Centre 4.9 Dall 0.8 .38 


SS 
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The means under school types for each activity and penetration level are 
reported in Tables 1 and 2. The direction is toward K-8 schools. In general, 
therefore, the grade eight students in the K-8 schools surveyed were more involved 
in school activities and penetrated more deeply into these activities than their 
counterparts in the intermediate schools. 


TABLE 2 


F RATIOS FOR UNIVARIATE TESTS OF EQUALITY OF STUDENT 
PENETRATION INTO THREE LEVELS OF PENETRATION 


Levels of Intermediate K-8 Mean F p 
Penetration Mean 

Level 1 8.66 7.80 7.67 00 
Level 2 4.90 5.78 10.81 .00 
Level 3 6.58 7.86 20.32 .00 


In order to investigate student involvement further, a post hoc univariate 
analysis of variance was carried out in each of the twelve school activities. The 
univariate F ratios that are reported in Table 1 indicated that differences beyond 
the .05 level of significance in the direction of K-8 schools occurred in the 
following activities: Games with Other Schools; House League; Intramural Sports; 
Music; Service Clubs; Helper Groups. At the same time, significant differences 
favouring the intermediate schools were indicated in relationship to the following 
activities: Parties and Dances; Students’ Council; School Elections; and School 
Offices. 

Student penetration into the activities was investigated further because the 
depth of penetration is related conceptually to the degree of responsibility for the 
maintenance of the activity assumed by the students. A post hoc univariate 
analysis of variance was carried out in each of the three levels of penetration. The 
F ratios given in Table 2 indicate that grade eight students in K-8 schools were 
more involved at the second and third level of penetration, while grade eight 
students in intermediate schools were more involved at the first level. In all cases, 
the level of significance was beyond the .05 level. 
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Discussion 


Student Participation 


One of the assumptions of the study was that the level of student involvement 
in school activities is an indication of the school’s success or lack thereof in 
meeting the unique aims of intermediate education. 


The results of the statistical analyses support the hypothesis and indicate that 
the students in K-8 schools are generally more involved in school activities than 
those in intermediate schools. This finding bears out the results of previous 
behaviour setting studies on school size and student participation (Barker & 
Gump, 1964). It is in keeping with Willems’ (1974) claim that the rate of 
participation diminishes as school size increases. 


In addition to greater participation on the part of small school students, 
behaviour setting studies also indicate that small school students become involved 
in a wider variety of school activities (Gump & Friesen, 1964a). An inspection of 
the means reported in Table 2 shows that the students in the K-8 schools are 
involved as much or more than their intermediate school counterparts in nine of 
the twelve activities included in the survey. This suggests that, in addition to 
being more involved generally, these students probably participate in a wider 
variety of activities. This is corroborated by the data gathered prior to the study 
from 12 of the 20 schools included in the study. These data showed that the six 
K-8 schools surveyed offered 315 activities while the six intermediate schools 
offered 314. These data bear out Barker and Barker’s (1964) contention that 
“small high schools are not so small on the inside” and lend support to the 
findings which indicate that the proportion of settings to the number of students 
is higher in small schools (Gump & Friesen, 1964a). 


The post hoc statistical analyses carried out in the twelve activities revealed 
that the grade eight students in the K-8 schools are more involved in Games with 
Other Schools, House League or Intramural Sports, Music Groups, and Service 
Clubs or Helper Groups. On the other hand, the grade eight students in 
intermediate schools are more involved in Parties and Dances, Students’ Councils 
and School Elections, and School Offices. 


Of the seven activities where differences occurred, the four which appear to 
have the greatest potential for active participation are in the direction of the K-8 
schools. Games with Other Schools and House League Sports were defined on the 
Student Involvement Survey (SIS) as indoor and outdoor sports including team 
and individual sports. Generally speaking, sports at the elementary level are not 
spectator sports. Therefore, it is probably reasonable to conclude that most of the 
involvement in these activities is at an active performance level. In designing the 
SIS, it was presupposed that participation in Music Groups and Service Clubs had 
to be at an active level and, therefore, the first level of involvement was not 
included on the instrument. Consequently, the student responses in the Music and 
Service Club sections of the instrument are all at the active level. 


On the other hand, two of the activities that show a difference in the direction 
of the intermediate schools probably have more potential for passive onlooker 
roles. Parties and Dances are usually large group activities—especially in 
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intermediate schools—where the majority of students participate at the 
attendance level. Attendance was placed in the lowest level of involvement on the 
SIS because it implied little or no requirement for exercising personal initiative or 
responsibility. The significant difference for this activity in the direction of 
intermediate schools may be attributable to the relatively large number of 
students in these schools. Similarly, the difference in involvement between 
intermediate schools and K-8 schools in Students’ Councils and School Elections 
may be a function of the fact that the majority of participants are in the voter 
category—the lowest level of involvement. This majority is proportionately larger 
in the intermediate schools because the constituencies in these schools are 
considerably larger. 


In summary, the study revealed that the grade eight students in the K-8 
schools in the sample are generally more involved in school activities than the 
comparable students in the intermediate schools in the sample. 


Student Responsibility 


The examination of the pattern of involvement of students from the two 
different types of schools suggests a difference in the quality of the involvement as 
well. The question of quality is also related to the concept of penetration. The 
depth of an individual’s penetration into a setting is considered to be an 
indication of the individual’s sense of obligation or responsibility for the 
maintenance of the setting. It was predicted that grade eight students in K-8 
schools would assume a greater responsibility for the maintenance of school 
activities than grade eight students in the intermediate schools. 


The results of the statistical analyses indicate that the students in the K-8 
schools penetrate school activities more deeply than the students in the 
intermediate schools. This finding suggests that the students in the K-8 schools 
assume a greater degree of responsibility for the activities in which they 
participate and probably feel a greater sense of obligation than the intermediate 
school students. 

The post hoc tests on the three levels of penetration reveal that the 
intermediate students are more involved in school activities at the first level of 
penetration while the K-8 students are more involved at the second and third 
levels. These findings give added support to the hypothesis. They are also in 
keeping with the findings reported in the literature—students in small schools 
hold more positions of responsibility in a wider variety of activities than students 
in large schools (Gump & Friesen, 1964a) and leadership achievement is higher 
among students from small schools (Baird, 1969). 

The fact that small school students assume more positions of responsibility is 
explained by the concept of undermanning (Wicker, 1968). The data collected on 
activities in 12 of the schools in the study lend support to the undermanning 
theory. The ratio of activities to students for the intermediate schools surveyed 
was .2:1 while the ratio for the K-8 schools was almost 1:1. In other words, far 
fewer students are available to maintain the activity settings in the K-8 schools. 
Consequently, many more students are involved in responsible roles in these 


schools. 
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In summary, the study shows that grade eight students in the K-8 schools in 
the sample penetrate school activities more deeply than their counterparts in the 
intermediate schools in the sample. Furthermore, the examination of the 
involvement at the different levels of penetration indicate that the students in the 
K-8 schools occupy more positions of responsibility in school activities which, in 
turn, gives support to the idea that students in small schools generally feel a 
greater sense of obligation and responsibility for these activities. 


Conclusion 


These findings have implications for intermediate education if the students at 
this level are to have real opportunities to assume responsibility and to make 
decisions—the prerequisites for developing independence and _ individuality 
(Desjarlais & Rackauskas, 1975). The results call into question the efficacy of 
intermediate schools in fostering student participation in school activities. This 
could be an educational issue of some import because many intermediate schools 
exist and more will undoubtedly be established. School planners should take into 
account the effect of size on behaviour when new intermediate schools are being 
considered. This effect should be balanced against the potential advantages of the 
more specialized staff and facilities that larger schools generally make possible. 


There is a need for more research on student involvement. The possible effects 
on involvement of the different rates of maturation of boys and girls during their 
early adolescence could have important implications for school organizational 
patterns. Another question that might be investigated is the influence that the 
number of years in a school has on student involvement. This factor could favour 
K-8 schools where students are normally resident for more years than their 
counterparts in intermediate schools. The greater range of grades in the K-8 
schools and the closer association of these schools with their communities may 
give rise to more opportunities for student involvement in helper roles and 
community activities. These questions could be looked at in conjunction with a 
more detailed investigation of the intermediate school environment and its effect 
on student behaviour. 

Clearly defined academic programs and well trained, sensitive teachers of 
course are of paramount importance in schools, large or small. These factors will 
always be central to the success of schools. However, the matter of student 
involvement in school activities is also important, especially at the intermediate 
level. In analyzing the implications of the research by Spady (1971), Erickson 
(1977) puts the case for school activities thus: 

The extracurriculum, having a down-to-earth, here-and-now quality, featuring 
activities often consequential to both student and adult communities and demanding 
attributes not essential to classroom functioning, may provide a superior learning 
environment, particularly for attitudes and skills needed in college and later 
occupational endeavors. (p. 13) 

In terms of the research presented in this paper the matter comes down to a 
question such as this: “Is the little fellow in the grade eight class more likely to be 
on the school baseball team if he is in K to 8 school?” 


The answer is: “Yes, probably.” 
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ERRATUM 


Volume XXVI Number 1, March, 1980 issue of AJER: Predicting the 
Educational Bias Effect by G. G. Foster, B. Algozzine and R. C. Eaves should 
read: G. G. Foster, B. Algozzine, R. C. Eaves and M. H. Diebold. Dr. Diebold 
participated in the planning, data collection phase, and manuscript editing of 
the study. Drs. Algozzine, Foster and Eaves sincerely regret and take 
responsibility for the oversight. (Martin H. Diebold is Assistant Professor of 
Special Education at Auburn University, Auburn, Alabama.) 
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Patterns of Work Values Held 
by Secondary School Girls 


The work values of 541 girls registered in an urban secondary school were 
measured using Super’s Work Values Inventory. It was concluded that these 
girls hold work values which differ very little from grade to grade. Generally, 
the girls tended to value security, achievement, and the “good life,” and to 
value least management, intellectual stimulation, and esthetics. The results 
demonstrate the need for educators to identify the values of their students as 
they attempt to understand more fully the reasons for the students’ behaviour. 


Two major factors lay the foundation for an educator’s greater understanding 
of learners’ functioning level: a knowledge and understanding of learners’ level of 
development, and sensitivity to their value system. Such awareness will also serve 
as a guideline for predicting the learners’ future development and behaviour. 

Recent professional literature is replete with studies describing the various 
value systems held by adolescents. This paper attempts to provide additional 
insight into one specific set of values: the work values held by adolescents. Such 
insight is necessary if individuals involved in helping youth are to assist them in 
identifying and understanding the implications that their value orientations may 
have in relation to their educational and vocational planning. 
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Green (1946) contended that an individual’s complete value system must be 
considered by the helping professions. Consequently, the values held by young 
learners must be a matter of concern for any practitioner involved in the area of 
assisting them (Super, 1970). This is especially true if Newton’s (1967) observation 
is correct—that the school presents an environment which practically demands 
that adolescents conform to external standards. Educators who introduce values 
clarification procedures to the adolescent, especially to one who lives in an 
environment which has a set of values dissimilar from those of the educators, will 
produce a frustrating effect (Thomas, 1974). This frustration may, however, be 
somewhat softened. Domenichetti (1970) reports evidence suggesting that many 
diverse work values held by adolescents change continuously, surfacing and 
disappearing at various stages of their growth and development. 


Education and work values are difficult to separate. Both are still prized in our 
western society. The right to decide the type of work one will engage in, and for 
whom and when one will work, is a freedom which can be exercised if the 
individual wishes to do so. This potential of choice is highly valued. 

Williamson (1959) notes that in order for individuals to develop a clearer 
perspective of their occupational development, vocational calling, or career goals, 
they must be aware of their personal work values. 

A need exists then for helping agents to identify and understand the work 
values held by these young people, and to examine the effect of their value 
systems upon their various actions. The present paper is addressed to similarities 
and differences in work values as a function of grade level. It is hypothesized that 
the work values held by girls in Grades 9-12 will show a developmental trend. 


Method 


Subjects 

The sample consisted of 541 female students whose ages ranged from 14-20 
years and who were enrolled as full-time day students in a southwestern Ontario 
urban secondary school. This sample accounted for 96% of the total female 
population; the 51 male students were not included in the study due to the small 
population. Table 1 indicates the distribution of the 541 students. 


TABLE 1 


DISTRIBUTION OF SUBJECTS 


Grade Level Sample Size (N) 
9 145 
10 14 
HG 126 
12 NaGs 


Students attending the school tended to come from below average income 
families; furthermore, the majority of these students specialized in business and 
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commerce-related studies, and no more than 10 percent continued their education 
beyond Grade 12. 


Procedure 


Super’s (1970) Work Values Inventory (WVI) was administered to the total 
school population in early May, 1973. Although the participants scores their own 
WVI, the senior author examined all recorded scores for error. Data produced by 
this administration were rank-order correlated among all grade levels for the 15 
WVI variables. In addition to the average rank obtained by this procedure, the 
total absolute deviation and amount of deviation from the mean were calculated 
for each of the 15 values. 


Results 


The matrix resulting from the rank-order correlation of the 15 WVI variables 
is reported in Table 2. Table 3 makes it possible to discern which group of 
students deviated most from the average for each of the 15 WVI variables. 


TABLE 2 


CORRELATION MATRIX 


9 1.00 
10 85 1.00 
11 89 85 1.00 
12 96 82 91 1.00 
9 10 11 12 


The hypothesis of the developmental trends among work values showed mixed 
support. Upon a review of the data, it is evident that of the 15 WVI variables 
only six can be classified as showing major differences. The balance were 
essentially similar for all grade levels. 

A value showing a deviation of +2.0 was considered to be different. A value 
deviating more than +2.0, and coupled with a proportion of the total absolute 
variation of over 0.4 was considered to be even more meaningful. Of the six 
values classified as different, five appeared with a proportion of variation greater 
than 0.4. 

It may be useful to consider the grade by grade breakdown of differences. 
The average rank of the values for the ninth-grade girls did not acquire a large 
enough deviation from the average on any variable sufficient to be viewed as 
different from girls in any other grade. (The three such values reported in Table 
3 are all less than the required absolute value of 2.00.) 

Three of the six differing values with large differences can be attributed to 
the Grade 10 girls: Creativity, Supervisory Relations, and Way of Life. These 
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TABLE 3 


TABLE OF DEVIATIONS 


Work Group(s) 
Values With Total Proportion 
Inventory Largest Average Largest Absolute of 
Variable Difference Rank Deviation: Deviation" Variation® 
(1) (2) (3) (4) (5) 
Altruism 11 4.75 aaLeo 5.0 0.35 
Esthetic iti 14.25 -0.75 1.5 0.5 
Creativity 10 11 igs OOm 7.0 0.43 
Intellectual 11 2 S00" 4.0 0.5 
Stimulation 
Achievement 9 3.25 -1.75 aD 0.5 
Independence 10, 11 iS -0.75 2.0 0.38 
Prestige 11 10.5 ee 0:25 0.52 
Management 11 14.75 teat 125) 0.5 
Economic Return 12 4.62 5 os 7.5 0.35 
Security 12 a5 -1.75 5.0 0.35 
Surroundings 9, 10 7.0 i 20 0.5 
+ 1.00 
a ae tad 10 3.38 +2.38* 4.74 0.5 
Associates 9, 10 8.5 Me 4.0 0.38 
Way of Life 10 2.0 -3.00* 7.0 0.43 
Variety 10 9.25 1525 Bd) 0.5 


* Indicates values classified as different. 


* The ranking of the indicated group can be obtained by subtracting the value in column (3) from 
the value in column (2). 


» This column is the sum of the absolute values of all the differences from the value in column (2). 


« This value is obtained by dividing the value in column (3) by the value in column (4). 


values also earned enough of the proportion of variation to make them very 
meaningful, accounting for more than 40% of all the variation for that value. 
The strength of Creativity and Supervisory Relations was above average for all 
grades in average rank, reflecting an increased desirability for these values for the 
Grade 10 group; Way of Life showed the opposite deviation, reflecting a lower 
desirability. 

It was observed that the group of Grade 11 girls differed from the others by 
having the largest deviation from the average and the highest proportion of 
variation on two values, Intellectual Stimulation and Prestige. Both values were 
meaningfully lower than the average rank. 

The group of Grade 12 students were above average on one value, Economic 
Returns. However, this value showed some fluctuation from grade to grade and 


their group’s +2.62 difference accounted for only about one-third of this 
fluctuation. 
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Average ranking of the values should also be considered. The highest ranked 
value was observed to be Way of Life, with Achievement and Security holding 
second and third place. At the other end of the scale, the values Intellectual 
Stimulation, Esthetics, and Management, respectively, held the last three 
positions. 


Discussion 


The sizes of the correlations presented in Table 2 are all sufficiently large 
enough to suggest that the 15 WVI variables are closely related. The Grade 10 
group, however, appears to be somewhat removed from the other groups in the 
things they value. This has led to further investigation of this research data.’ 
This observation suggests that these girls hold work values which differ 
very little from grade to grade. 


On a grade by grade breakdown, the Grade 9 girls do not divert from average. 
The grade 10 girls seem to be showing signs of self-actualizing concerns by their 
higher rating of Creativity and by the lowered value of Way of Life. This event is 
somewhat anomalous, but has been explained elsewhere as a counselling effect. 
The grade 11 girls seem to be rejecting the esteem-oriented variables of 
Intellectual Stimulation and Prestige, suggesting an affiliation orientation on the 
Maslovian Hierarchy. Finally, the Grade 12 girls, only 2 months from graduation, 
reflect a practical note by rating Economic Returns higher than the rest. 


The general picture would seem to imply that many of the students, 
especially those in Grade 11, do not desire work which entails much thought; in 
addition, they are not overly concerned with how or why things work. Consistent 
with this trend is the noticeable fact that these girls are not esthetically 
motivated. Also, because Management is least valued, it becomes more obvious 
that the majority of the students are not desirous of a position of authority. 
Levine (1971) may have indicated the basis for this trend: 


Mitchell (1968) reported that low achievement press was independently associated 
with socioeconomic status and proportion of students expressing a desire for college 
training, thus suggesting that the “tone” of low income schools as well as the attitudes 
prevalent among their students may have a depressing effect upon educational 
aspirations. (p. 109) 

It is generally accepted that educators in a secondary school setting ought to 
know their clientele. In the present study, the work values held by girls were 
examined. These girls are concerned with security and the “good life” and tend to 
value management, intellectual stimulation, and esthetics least. This raises serious 
questions related to educators understanding the values held by their students as 
a basis for understanding the various kinds of behaviour exhibited by the 
students. These are questions which must be considered not only by educators but 


by institutions which prepare them for helping roles. 


Notes 


ade 10 group who received counselling one year earlier showed 


that subgroups of the Gr 
: Oe tail f : e Wearne and Powell, 1977). 


significantly distinctive patterns of work values (se 
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This paper presents data on ethnic schools in three Canadian metropolitan 
centres, dealing with selected characteristics such as social background and 
attitudes toward ethnicity of its personnel, curricular and instructional 
emphases and practices, and institutional and organizational features. Using 
these descriptive data, it then discusses some of the indirect but probably more 
important functions of these schools with respect to ethnic socialization and 
development of ethnic identity of their students. The discussion is used to 
formulate and suggest some research questions related to the broader role 
ethnic schools might play in future under conditions of cultural pluralism. The 
paper is intended to serve as a corrective to the relatively narrow focus 
implicit in discussions on the role of the schools in teaching ethnic languages 
and, at the same time, to stimulate the interest of educational researchers in 
the study of ethnic institutions. 


Interest in ethnic schools has been stimulated by a resurgence of ethnicity 
here and abroad. Ethnic diversity and its implications for political integration and 
stability within nation-states have received considerable scholarly as well as 
political attention during recent years (Bell & Freeman, 1974). Canada has 
developed into a multi-ethnic society, particularly since World War II. According 
to the 1971 census, the Canadian population was distributed as follows: 44.6 
percent were of British extraction; 28.7 percent, of French extraction; and 27.5 
percent, of approximately 30 other ethnic origins.’ The Royal Commission on 
Bilingualism and Biculturalism was established in 1963 by the Federal 
Government to deal with problems related to this ethnic diversity. 


ung an Associate Professor in the Department of Educational Foundations, 
Department of Sociology. In addition to aspects of multiculturalism, their 
he Third World and sociology of education. 


Raj Pannu is a Professor and John Yo 
both holding joint appointments in the 
research interests include education and development in t 


247 


R. S. Pannu and J. R. Young 


Book IV of the report of the Commission examined the question of cultural 
and ethnic pluralism in Canada and the status of various ethnic cultures and 
languages. One question the Commission attempted to answer was, to what degree 
have non-British, non-French ethnic groups remained attached to their original 
cultures and languages? (Canada, RCBB IV, 1969). Subsequently, the Federal 
Government accepted and endorsed the recommendations and spirit of Book IV of 
the Report of the Royal Commission on Bilingualism and Biculturalism and 
thereby adopted its policy of multiculturalism on October 8, 1971. To implement 
this new policy of multiculturalism, the Cultural Development Program in the 
Department of Secretary of State was instituted to emphasize, in particular, 
research on language, language retention and cultural development, and the role of 
organizations such as educational institutions in language and cultural retention 
and development. Since the publication of this Report, the phenomenon of 
language and cultural maintenance in Canada has emerged as an important area 
of investigation in the social sciences. 


The major objectives of this paper are two-fold: firstly, to look at the role of 
language retention in cultural maintenance, and secondly, to present data 
regarding some selected characteristics and probable functions of ethnic schools in 
Canada. In addition, it is hoped that the paper will draw attention of Canadian 
educators to these schools as an important area of research. Data for this paper 
are drawn from a study (Pannu & Young, 1976), funded by the Department of 
Secretary of State, which was intended to examine the role of ethnic schools in the 
teaching and maintenance of ethnic languages. 


Language Retention and Cultural Maintenance 


Prior to the early sixties, studies of immigrant populations tended to 
emphasize processes of assimilation and integration and problems of disappearing 
ethnic identities in the cultural melting pots of the host societies (Elliott, 1971). 
Much less work had been devoted to studying the processes whereby ethnic groups 
resisted total assimilation. More specifically, the processes of cultural maintenance 
and language retention had been neglected in research on immigrants. This 
neglect of the phenomenon of ethnic survival and continuity may have been the 
result of essentially ideological presuppositions underlying the _ theoretical 
perspective of studies of ethnicity (Richmond, 1969). 


In Canada, however, this notable imbalance in social science research has been 
redressed in recent years. For instance, in the past decade the number of studies 
of the various social and cultural factors which contribute to cultural maintenance 
and ethnic identity has notably increased (Berry, Kalin, Taylor, Lamarche & 
Christian, 1975; Drieger, 1973, 1974; Drieger & Church, 1974; Drieger & Peters, 
1973; Fathi & Kinsley, 1968; Gold, 1973; Lieberson, 1970; Migas, 1975; Millet, 
1975; Palmer, 1972; Reitz, 1974). 

Studies of the Canadian social structure (Clements, 1975; Porter, 1965) have 
helped bring to notice and underline the fact of the continuity of ethnic diversity 
and its unchanging character by pointing to its demonstrable manifestations in the 
patterns of social stratification. However, as early as 1960, one Canadian study of 
ethnic communities in Montreal (Breton, 1964) focused on features of their social 
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organization—the degree of institutional completeness or self-sufficiency—which 
enabled each ethnic community to attract the immigrant into its social boundaries 
and thereby resist assimilation of the ethnic group into the host society. This 
study drew attention to the fact that the degree of institutional completeness of 
an ethnic community was an important factor in determining whether the 
immigrants would socially integrate in the host society at large, would be attracted 
to another ethnic community, or would integrate into the community of their 
ethnicity. In addition to colour and religion, a high degree of institutional 
completeness of an ethnic group was found to be prominently related to its 
distinctive language. The dependence of the survival of an ethnic community on 
these two factors suggests that there may be a critical link between an ethnic 
group’s access to its distinctive language and its cultural survival, although this 
relationship may be mediated by the degree of institutional completeness. Other 
studies tend to bear this out (Drieger & Church, 1974). 


Further, Richmond’s study (1972) of immigrants and ethnic groups in Toronto 
focused on those features of ethnic group life and social organization which are 
related to the group’s efforts at cultural maintenance. The study of language and 
ethnic relations in Canada conducted by Lieberson in 1966, the research and other 
materials included in the Report of the Commission on Bilingualism and 
Biculturalism (Canada RCBB IV) and a few recent studies on non-official and 
official languages in Canada conducted under the sponsorship of the Department 
of Secretary of State following the adoption of the policy of multiculturalism by 
the Federal Government (Berry et al., 1975) have examined certain aspects of the 
processes of language and cultural maintenance in Canada. It should be noted, 
however, that although these studies have examined several aspects of cultural 
maintenance, few have focused directly on institutional mechanisms of language 
maintenance and cultural identity per se. 


Institutional Mechanisms of Language Retention 

The language maintenance efforts of ethnic groups in Canada, as well as the 
United States, have taken several forms. The two most well-known are (a) the 
ethnic school and (b) the ethnic media and press. In the United States, Fishman’s 
research on the ethnic schools (1966) was a pioneering venture which sought to 
focus on two dimensions of the phenomenon. It sketched the main organizational 
and institutional features of the schools run by or on behalf of ethnic groups in 
the United States: their size, auspices, faculty, student body, curricula, etc. From 
these descriptive data, Fishman derived the language maintenance implications of 
these schools. He concluded that the ethnic group school was the most active 
language maintenance institution in the majority of ethnic communities. These 
schools succeeded in reinforcing or developing moderate comprehension, reading, 
and speaking facilities in their pupils. The level of facility attained in the 
languages by pupils was adequate to provide a foundation for language and 
cultural maintenance in the ethnic communities. 

In Canada, ethnic schools have been functioning for several decades. However, 
little was known about these schools until Krukowski and McKellar (1966) 
completed a study on Canadian ethnic schools for the Royal Commission on 
Bilingualism and Biculturalism. The study included 585 such schools run by or on 
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behalf of 21 different ethnic groups. Based on a questionnaire survey, this study 
included information on the schools regarding sponsorship, location, financial 
resources, curriculum, teaching staff, textbooks, and common problems. According 
to the B & B Commission Research Director, the study included useful 
information. Nevertheless, it was incomplete and the quality of the descriptive 
data inadequate. Volume III of the Report of the Commission on the Position of 
French Language in Quebec (more frequently referred to as the Gendron Report) 
dealing with ethnic groups, examined the question of ethnic groups in Montreal. It 
directed specific attention to “ethnic schools, their significance, the role they play 
in the position of French language in Quebec, that is, whether they made the 
knowledge and use of French easier, and to what extent they could promote these 
features” (Government of Quebec, 1972, p. 248). It provided very brief descriptive 
information about such schools with no information on curriculum, textbooks, or 
institutional organization. 

More recently a study of ethnic language instruction was undertaken in 
Toronto in 1973, the purpose of which was “to co-ordinate information regarding 
‘third’ language schools in one document embracing all cultural communities in 
Metropolitan Toronto” (Canada, Secretary of State, 1973, p. 1). This document 
was restricted to a presentation of the names, addresses, ethnic affiliation and size 
of 265 ethnic schools in Toronto. 

The Non-official Languages Study (O’Bryan, Reitz & Kuplowski, 1974) also 
indirectly dealt in part with one aspect of ethnic schools in third language 
retention. It examined the degree of support among different ethnic groups for 
these schools and found widespread support of the ethnic school for all groups, 
although the support varied greatly from one ethnic group to another. 


These studies notwithstanding, the general lack of knowledge and interest of 
Canadian social scientists and educational researchers regarding the existence, 
nature and role of these schools, continues more or less undiminished. Borrowing 
Fishman’s words in order to describe the situation in this country, the 
supplementary language schools in Canada have remained “largely terra 
incognita.” The study of these schools calls for the sustained attention of 
Canadian scholars not only because the schools represent an important facet of 
Canadian social and educational history, or because the results of research may 
provide useful inputs to policy formulation related to the federal government’s 
policy of multiculturalism, but also because they may serve as sources of valuable 
data relevant both to the study of ethnicity and to an aspect of Canadian 
education hitherto unexplored. 


Research Proc2dure 


This study focused exclusively on large urban centers in Canada west of the 
province of Quebec whose populations included sizeable numbers of individuals of 
non-English or non-French ethnic origins.” Urban centers were selected because 
conscious efforts by ethnic groups for the maintenance of their traditional 
languages are known to be concentrated in such community settings (Fishman, 
1966). 
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As to the selection of specific urban centers, the decision was made to include 
three cities located in different regions of Canada which contained significant 
numbers of individuals of “other” ethnic groups. As a result, the cities of Toronto, 
Edmonton, and Vancouver were selected. Specific ethnic groups included in the 
sample from each city were as follows: 


Toronto Edmonton Vancouver 
Chinese Chinese Chinese 
Greek German German 
Hungarian Hungarian Greek 
Italian Italian Italian 
Jewish Polish Ukrainian 
Polish Ukrainian 
Ukrainian 


As can be seen, a total of eight different ethnic groups were selected and each 
group was represented in more than one city. 


Data Collection 


Preliminary field work by the researchers consisted of a census of 
supplementary language schools within the three cities. From this census a total of 
142 supplementary language schools were identified. The actual data collection, 
completed in two phases, was conducted by the Survey Research Centre, York 
University. 


TABLE 1 


NUMBER OF ETHNIC LANGUAGE SCHOOLS AND RESPONSE RATE 
FOR MAIL-OUT SURVEY (Phase One) 


ee 


Edmonton Toronto Vancouver 
Schools Completed Schools Completed Schools Completed 
Chinese 2 0 5 3 y 1 
German itil 9 not included ] 4 
Greek not included 5 2 4 3 
Hungarian il 1 3) 2 not included 
Italian 2 y i : 6 LEN, 1 
Jewish not included 34 15 not ine ude 
Polish il 1 12 9 not included 
olis 

4 
Ukrainian 14 9 25 11 6 
TOTAL 31 29 85 43 26 13 


we 
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Phase One of the study utilized a questionnaire mailed to the directors or 
principals of the 142 schools. Of this number, 78 responded to and completed the 
questionnaire.* The questionnaire was designed to elicit information about various 
aspects of the schools such as organizational structure, financial support, 
personnel, and curriculum. 


Phase Two of the data collection consisted of structured interviews. In this 
phase, 167 respondents out of a total of 204 identified were interviewed, yielding a 
response rate of 82%.‘ Interviews were conducted in the language of choice of the 
respondents. Questions of a qualitative nature such as attitudes toward ethnicity 
and teacher morale were included in the interview questionnaire. 


TABLE 2 


RESPONDENTS SELECTED FOR INTERVIEW AND COMPLETION RATES 
(Phase Two) 


Edmonton Toronto Vancouver 
Ethnic Group Selected Selected Selected 
People Completed People Completed People Completed 
Chinese not included 6 5 i 3 
German Dil 25 not included ser 12 
Greek not included 9 6 i 6 
Hungarian 23 3 5 5 not included 
Italian 4 3 iS} 6 6 x 
Jewish not included 34 33 not included 
Polish 3 3 6 3 not included 
Ukrainian 29 28 15 res 11 10 
TOTAL 66 62 90 71 48 34 
Findings 


Institutional Characteristics 


Ethnic schools are not a recent occurrence. Of the schools sampled, over 70% 
were in existence prior to 1965. This was most evident for German and Ukrainian 
schools. Of more recent origin were Greek schools, 60% of which were established 
during 1970-72, and Italian schools, 50% of which were established during the 
same time period. 

In terms of the original sponsorship of the schools, the most common sponsor 
was the church or associated religious personnel: 46.2% of all schools were 
initiated by priests, ministers, rabbis, or church groups. This was particularly true 
for Hungarian and Chinese schools and least likely for Greek schools. The next 
most common sponsors of the schools were ethnic-cultural organizations (25.6%) 
and the third most frequently reported sponsors (12.8%) were parents’ groups. 
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While churches were the original loci of ethnic schools, the trend now is 
toward the use of public school facilities. At the time of formation, for ethnic 
groups as a whole, 61% had their schools located in church or religious centers; in 
1974, approximately 35% of ethnic schools were located in public or separate 
school buildings. Approximately 90% of the ethnic schools were affiliated with 
some other institution or organization at that time, the most common affiliation 
being with church groups (44.2%) followed by ethnic-cultural organizations 
(19.5%). However, 40% of the Greek schools and 23% of the German schools were 
not affiliated to or sponsored by formal organizations. 


TABLE 3 


NUMBER OF STUDENTS AND TEACHERS 
IN ETHNIC LANGUAGE SCHOOLS 


pi 


Student 
Teacher 
Schools Teachers Pupils Ratio 

Chinese 4 25 239 9.6 
German 13 71 848 11.9 
Greek 5 Tt 527 47.9 
Hungarian 3 18 305 16.9 
Italians 4 129 3,007 ote 
Jewish 15 359 4,419 2 
Polish 10 184 Dag a2 


TOTAL 76 837 13,301 

All ethnic groups utilize more than one source for funding their activities. 
Tuition fees charged students were an important source of funding for 90% of all 
schools. In addition to tuition fees, schools were funded by church or community 
organizations. All the Hungarian, Italian and Jewish schools charged a tuition fee. 
However, in 30% of Polish schools, 25% of Chinese schools, 20% of Greek 
schools, and 15.4% of the German schools, students were charged no tuition fees. 
As to the amount spent per student per year on textbooks and other instructional 
materials, the average was about $13 per student per year. By ethnic group, it was 
highest for Jews ($27) and Ukrainians ($14) and lowest for Germans ($2.40) and 
Hungarians ($3). 

In the 76 schools sampled, the total student enrolment was 13,301 students 
with an average of 175 students per school. The largest average enrolment was for 
Italians with an average of 877 students per school followed by Jews with an 
average of 295 students; Ukrainians, 110; Greeks, 105; Poles, LOZ: Hungarians, 101. 
Smallest average enrolments were reported for the Chinese schools with an 
average of 60 students followed by the German with an average of 65 students. It 
is significant to note that the range of enrolments in the sampled schools was 
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extensive from a low of 9 to a high of 2,860 students. Regarding student/teacher 
ratios, Greek schools had the highest student/teacher ratio of 48 to 1. For the 
remaining groups the number of students per teacher was as follows: Italian, 27; 
Polish, 26; Hungarian, 17; Ukrainian, 13; Jewish, 12; German, 12; and Chinese, 10. 

The problem of student attrition in ethnic schools was two-fold: one, the 
percentage of students dropping out during the year; and two, the percentage not 
returning for the following year. For all schools reporting in phase one, 96% 
reported less than a 20% drop-out rate during the year. The drop-out rate was 
highest, however, for the Chinese, where one-quarter of the schools indicated a 
drop-out rate of over 50%. Regarding the year-to-year attrition rate, close to 
three-quarters (70.5%) of the schools reported that over 80% of their students 
returned the following year. Attrition was lowest for the Polish, Jewish, and Greek 
schools and highest for Hungarian and Italian schools. As can be seen in Table 3, 
these two groups had higher student-teacher ratios than all other groups except 
the Greeks. 


There was considerable variation between ethnic groups with reference to 
changing patterns of student enrolment. Concerning trends of student enrolments 
for the ethnic groups taken as a whole, 48.7% of the schools had experienced 
increased enrolments over the past few years, while 15.4% indicated that the 
student enrolment had remained the same. However, when comparing ethnic 
groups with each other, the only ethnic groups that reported no decrease in 
enrolments in any of their schools over the past few years were Greek and 
Chinese. The ethnic group indicating the largest percentage decrease was the 
Hungarian, where 66.7% of the schools indicated a decrease in enrolment over the 
years. Of the German schools, 46.2% experienced a decrease, compared to 45.9% 
of the Ukrainian schools, 40% of Polish schools, and 33.3% of the Jewish schools. 
These enrolment changes might well reflect the changing age structures of the 
population of each ethnic group rather than a lack of commitment to their 
schools. 


Instructional and Curricular Emphases 


Looking specifically at the language of instruction, it was found that 
approximately three-quarters of the schools used English as well as the relevant 
ethnic language as a medium of instruction. This was particularly true for Jewish 
schools where 86.7% of schools used English. In German schools, 84.6% of schools 
reported the use of English. Only 26.7% of the ethnic schools reported that no 
English was used in teaching. No Greek schools in the sample used English in 
instruction, and over two-thirds (67%) of Hungarian schools reported no use of 
English in teaching. Ethnic groups in Edmonton were the most likely of the three 
cities to use English in the schools; 95.2% of the groups in Edmonton used 
English as against 62.8% in Toronto and 72.7% in Vancouver. 

Teaching language skills formed an important objective in all the schools. 
However, different language skills were differentially stressed. For instance, the 
schools placed most emphasis on the acquisition of conversational ability. This 
was followed by reading fluency and writing skills. Learning of grammar was not 
emphasized. 
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As to subject areas included in the curriculum, the most emphasized were 
literature, history, and general culture (including art, music, and dance). This 
emphasis on selective cultural and linguistic skills was often reinforced through 
the teaching of religion. One interesting omission was civics (political structures 
and processes) which was stressed least for all groups. The pattern of subjects 
emphasized in the curriculum seems to indicate that the central objective of these 
schools, apart from teaching selected skills and general proficiency in language, 
was the socialization of the child into the general culture of the ethnic group and 
the development of strong ethnic identity. The subject most popular with 
students, as reported by the teachers, was history of the homeland of the ethnic 
group. With reference to individual ethnic groups, the Chinese teachers reported 
that students liked music and dance, while the German teachers reported equal 
popularity of geography, music and dance, and literature. History of the ethnic 
homeland was seen as the most popular subject by the Greek, Polish, and 
Ukrainian teachers, and geography by the Italian teachers. 

A pressing problem for nearly all schools was that of finances. Generally, 
students were charged fees, but this source of revenue was inadequate to meet the 
schools’ needs. Nevertheless, educational personnel felt that the responsibility for 
financial support should be shouldered primarily by the parents, secondarily by 
the provincial governments, and lastly by the ethnic community. Interestingly, few 
felt that the federal government should be financially responsible in any way. 

Textbooks were cited as another significant problem for most schools. Few 
books were published or available for purchase in Canada. Consequently, books 
were difficult to obtain. (Most books used in the schools were outdated and/or 
were published in other countries and did not reflect the interests of Canadian 
students.) Most schools could not afford elaborate audiovisual aids and, for all but 
a few, language laboratories were out of the question. For these reasons, 
traditional teaching techniques were used. 


Social Characteristics of Students 

Of the sample as a whole, one-half of the schools reported that none of their 
students were born outside of Canada. However, differences between ethnic groups 
were evident. Italians, Greek, and Chinese schools reported the largest number of 
foreign-born students. Schools of the ethnic groups in Edmonton were more likely 
than those of ethnic groups in Toronto or Vancouver to report that none of their 
students were born in the country of origin of their ethnic group (68.2%, 46.5%, 
and 30.8%, respectively). 

The ethnic groups from which over 60% of the students were reported to be 
first generation Canadians, i.e., children of foreign-born parents, were the Greeks, 
Hungarians, Germans, Italians, and Poles. As was expected, most of the Jewish 
schools indicated that the overwhelming majority of their students were second, 
third, or fourth generation Canadian. 

Regarding the social class background of students, about 35% were from 
families reported to be unskilled or semi-skilled, 39% clerical, sales and minor 
professionals, and 25% professionals or higher business executives. German, 
Italian, Greek, and Hungarian students were more likely to be from working class 
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backgrounds whereas nearly 65% of Jewish students were reported as coming 
from middle class backgrounds. The Ukrainian schools appeared to attract 
students whose parents were from a wide range of occupational backgrounds. 


For students, enrolment in ethnic schools meant carrying an additional 
academic burden. Over two-thirds of the teachers reported that their students 
complained about having to attend supplementary language schools. This was 
highest in Chinese and Polish schools where all the teachers indicated that their 
students frequently expressed unhappiness in this regard. The fewest complaints 
were reported by the Greek and Ukrainian teachers. More specifically, students 
expressed concern to their teachers about losing their otherwise free evenings and 
Saturdays and that by attending a supplementary language school, they were 
given an extra work-load. 


Social and Attitudinal Characteristics of Staff 


Teaching and administrative personnel of ethnic schools have highly 
interesting characteristics. With reference to citizenship, 75% of the teachers were 
Canadian citizens. For the various ethnic groups specifically, a low of 22.2% of the 
Greek teachers reported Canadian citizenship compared to a high of 96.6% for the 
Ukrainians. All other groups reported 50% or more Canadian citizenship among 
teachers. Edmonton reported the largest number of Canadian citizens (84.2%) 
compared to 69.8% for Toronto and 71.4% for Vancouver. 


However, most of the teachers, 64.3%, were born outside Canada. All Chinese, 
Italian, and Polish teachers in the sample reported non-Canadian birth. This was 
least true for Jewish teachers (4.2%). Nearly 80% of Jewish teachers reported 
fifth generation Canadian origin. 


Regarding teachers’ educational backgrounds, the average number of years of 
formal education was 14.8. The most highly educated group of teachers were the 
Jews with an average of 16.4 years followed closely by Italians with 16 years and 
Ukrainians with 15.1 years, compared to a low for Chinese of 12.3 years and 
Germans with 12.7 years of formal education. Of the total sample, 35.7% had one 
or more university degrees and 8.1% had less than the completion of high school. 
Of Hungarian teachers, 16.7% had less than the completion of high school 
compared to 17.3% for Germans and 12.5% for Ukrainians. Of all teachers, only 
26.8% had attended teacher training institutions. In summary, although teachers 
were well educated, few had been trained specifically as classroom teachers. 


The educational profile of principals looks very similar to that of teachers, 
with one notable exception. They are slightly more educated than teachers, but 
not unlike the teachers, few had received formal teacher training. 


Teachers and principals came from a variety of occupations. Only 46.5% of the 
teachers held other full-time or part-time jobs, three-quarters of these being in 
business, administration, or at higher executive levels. Over one-quarter were 
housewives. As for the principals, all were employed full-time elsewhere and over 
three-quarters came from business, administration, or held higher executive 
positions. 


In terms of the perceived goals of their schools, both principals and teachers 
felt that the prime purpose was to provide basic knowledge of the ethnic language. 
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The schools’ second most important function was seen to be the maintenance of 
ethnic customs and traditions. While provision of knowledge of the basic language 
was seen as being particularly important by the Greek and Polish groups, both 
Chinese and Jewish teachers and principals were most likely to cite the 
maintenance of customs and traditions as the schools’ most important role. 
However, it is interesting to note that even if the public schools taught in their 
ethnic language, 62.5% of the principals and 58% of the teachers thought that 
ethnic language schools would still be required. This feeling was strongest for 
Hungarian, Jewish, and Ukrainian groups and weakest for Greeks, Italians, and 
Germans. In contrast to their counterparts in Edmonton and Toronto, the 
majority of Vancouver principals (85.7%) felt that their schools would not be 
required if the regular school system implemented a policy of teaching in the 
ethnic language. In Toronto, on the other hand, nearly three-quarters of the 
teachers felt that ethnic schools should still be maintained even if the local school 
boards introduced teaching in their ethnic language. Those who felt that ethnic 
language schools should continue under these conditions stated that ethnic schools 
were necessary to ensure proper language teaching, the maintenance of traditions 
and customs, and to ensure the teaching of religion. 

The existence of ethnic schools represents in varying degrees a desire on the 
part of members of ethnic groups to preserve aspects of their culture and 
communities. Therefore, the involvement in the ethnic community and the 
commitment to ethnicity of ethnic school personnel and the implications that such 
an involvement or commitment might have for the functioning and continuity of 
such schools in Canada must also be examined. 

The data presented here clearly demonstrate that the majority of both 
teachers and principals actively participate in ethnic activities and associational 
life besides the ethnic language schools (approximately 80%). For all groups, the 
principals are more involved in ethnic organizations than are teachers. As to types 
of organizations, the most commonly reported were cultural, educational, and 
religious. Similarly, principals and teachers are heavy consumers of ethnic media. 
They are aware of what exists and support the media through active participation. 
The findings on the use of ethnic media by the respondents suggest that, generally 
speaking, the ethnic school teachers and principals have a strong sense of their 
ethnicity and, perhaps, represent the “activists” among their respective ethnic 
communities. 

Regarding the federal government’s policy of multiculturalism, 92.7% of the 
principals and 64.3% of the teachers had heard of the policy and for both groups, 
an overwhelming majority were in agreement with it. The main reason given for 
supporting the policy was that it would help maintain distinct ethnic identities. 
About 35% of the respondents felt that in addition it would strengthen Canadian 


unity. 


Summary and Discussion 


Ethnic schools have been in existence in Canada for a long period of time. For 
most groups, schools represent a high priority, second only to their churches. 
Religious institutions have played a major role in the formation of schools, the 
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provision of space for these operations, the actual financial support, and the 
recruitment of teaching and supervisory personnel. However, there seems to be a 
trend toward the use of non-church facilities namely public or separate school 
facilities. 


The size of student enrolment varies greatly between ethnic groups. However, 
for most groups the number of students enrolled has remained fairly stable over 
the past few years. This appears to indicate that these schools might have reached 
a plateau in their ability to attract more students from their respective ethnic 
communities. This has happened despite the resurgence of interest in the 
maintenance of ethnic identity and political activism related to it in the past 
decade in Canada. One, however, could argue that the failure of these schools to 
increase their enrolments significantly is due to inadequacies in institutional 
facilities. However, the findings in this study indicate that facilities, although 
grossly inadequate, were not intolerably poor and, therefore, would not seem to 
have an adverse effect on teacher morale or commitment. Secondly, such 
inadequacies are more likely to influence the effectiveness of teaching in these 
schools rather than their ability to attract students per se. 


Although it can not be said with absoluate certainty, it appears that ethnic 
school attendance is not a predominantly upper or middle class phenomenon. 
However, the appeal remains restricted to only certain segments of the 
populations of each ethnic group. Thus, while it can not be clearly determined 
whether these schools serve a particular class-based clientele, in the case of their 
teaching and administrative personnel it is evident that a vast majority of them 
come from the middle and upper occupational strata. In addition, it is significant 
that school personnel are also the activists within their own communities. It can 
be tentatively concluded from these findings that the critical support for 
organizing and operating these schools comes from this relatively small segment of 
each ethnic group, the generalized support of church and _ church-related 
organizations notwithstanding. 


It was found most frequently that students are children of foreign born 
parents, with the exception of Jewish and Ukrainian students who are generally 
second, third, or fourth generation Canadian. Italian, Greek, and Chinese schools 
reported the largest numbers of foreign born parents. In other words, most 
students from the groups were first generation Canadian. Even more important is 
the fact that an overwhelming number of administrative and teaching personnel 
were of foreign birth. The fact that these schools recruited their teachers and 
students so heavily from these two groups (foreign born and first generation) 
would suggest that the ability to maintain their present strength is highly 
dependent on a steady and continuing flow of new immigrants into the country. In 
light of this analysis, then, the Italians with a high rate of immigration can be 
somewhat more optimistic about the continuing adequate supply of teachers and 
students. The only two obvious exceptions to this statement are the Jews and 
Ukrainians. 


Ethnic schools in Canada provide an alternative to the homogenizing influence 
of the public schools. While the latter are used as a vehicle of assimilation and 
integration for children into’ the mainstream of society, the former serve the 
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purpose of socializing the child into an awareness of and commitment to the 
ethnic language, cultural tradition, and history of the specific group. If Canada is 
to maintain her policy of multiculturalism and to use it to encourage and ensure 
cultural pluralism, then it appears that one effective institution to promote the 
realization of this objective may well be the ethnic school and its further 
development should be supported by public funds. 


It is no longer justifiable to view ethnic schools as devoted merely to the 
teaching of ethnic languages or some isolated aspects of ethnicity which is no 
longer “alive.” This study suggests that it would be highly fruitful to view the 
schools as representing potent milieus for the development and inculcation of 
ethnic identity in their young charges, the training of highly self-conscious future 
ethnic leaders, the training and production of ethnic elites at the community and, 
perhaps, national levels, and the drawing together of community members by 
galvanizing them through the highly symbolic activities of the ethnic school. 
Because the ethnic school, as we have portrayed it, brings together the 
particularities of the ethnic life and culture as experienced in the family, 
community, and educational setting, this institution might serve as an excellent 
means of teaching the ethnic language despite its inability to provide the latest 
technology for such instruction. Within the expanded conception of the ethnic 
school offered here, it is suggested that the following aspects of these institutions 
should serve as fruitful avenues for further research: 

(a) Ethnic schools as agencies of ethnic socialization, specifically in the 
development of ethnic identity. 

(b) A comprehensive survey of student populations of these schools focusing on 
who goes to these schools: for example, what percentage of the relevant age group 
of each ethnic group is represented by the students in these schools? In addition, 
study might be -made of the students’ perceptions of the school and _ their 
developing orientations towards ethnicity, the ethnic community, and cultural 
pluralism. 

(c) A thorough-going analysis of the schools’ curricula focusing on the explicit 
as well as implicit or underlying ideological positions and orientations which are 
communicated. This might be essentially an analysis of the hidden curriculum. 

(d) The effectiveness of these schools in teaching ethnic languages. To what 
extent can the cultural climate of the schools discussed in this paper compensate 
for (i) the lack of instructional technology to be found in modern language 
laboratories and (ii) the virtual lack of pedagogical training in the educational 
background of their teaching staffs? 


Notes 
tl, Total exceeds 100% due to rounding error. be 3 
De The Definition of Ethnicity used here is identical to the census (1970) definition, i.e., country of birth of 
nearest male ancestor before coming to this continent. ; 
a It should be noted that the number of schools that responded would have been higher than 78 had it not 


been for the fact that for some ethnic groups, especially the Italians in Toronto and Vancouver, the 
questionnaire was completed by one person in each city on behalf of all the Italian schools operating there. 
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4, All of the ethnic schools identified in Edmonton and Vancouver and a sample of schools contacted in Phase 
One in Toronto were included. In each of the selected schools, the director and a sample of two teachers 
comprised the sampling unit for that school. However, it should be noted that the number of selected 
respondents by city and ethnic group is not always a multiple of three in that three respondents were to be 
selected from each school, because of the modified sampling allocation in Toronto and because all schools did 
not have the complement of two teachers to complete the desired sampling unit. 
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PERSPECTIVES 


Twenty-five Years of Research in Education: 
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To search more diligently and to be driven on by curiosity is commonly the 
lot of one who chooses to become a researcher. Prolonged contemplation of a 
problem and the periods when progress seems lacking or painfully slow are 
frequently prerequisites to that splendid moment of perception when something is 
seen to be true about the world. But the private moment belonging to the 
researcher alone is not sufficient. It must be shared, made public, opened up to 
the tests of others’ perceptions and ideas, too. It is for this reason that the 
publication of the results of research is so vital—publication represents a formal 
invitation to criticism. And for a quarter of a century, The Alberta Journal of 
Educational Research has beenone of the few publications in Canada which has 
afforded an opportunity for researchers in education to put forward the results of 
their efforts for careful scrutiny and for practical application. 


It is intriguing to contemplate the interplay of events and ideas shaping the 
nature and direction of research activities in education primarily, though not 
exclusively, in the English-speaking world. The articles selected for publication in 
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AJER over its twenty-five year history provide a rather unique opportunity to do 
just that. While its editorial policies have developed and changed somewhat, the 
overall goal of presenting ideas that represent the best in all aspects of 
educational research has not. Accordingly, one can find in its pages examples of 
developmental trends in educational research that transcend international 
boundaries while at the same time demonstrating the ways such trends influence 
research being done in Canada. More specifically, AJER has reflected the research 
activity to be found in a university committed to fostering excellence in the study 
of education (Swift, 1970). At the outset of this retrospective look at the Journal, 
it should be acknowledged that the University of Alberta has demonstrated the 
seriousness of that commitment by its longstanding support of this publication. 
The Journal articles, seen as a whole, provide a mirror image of the ebb and flow 
of interests and emphases, the growth and decline of particular ideas, and the 
effects of change in society at large on the questions and problems addressed by 
researchers in education. 


Educational research in North America generally has changed in style and 
scope within the context of demographic and other social changes surrounding the 
whole of the academic research enterprise. In Canada, we cannot set aside the 
extensive influence of public policies and government resource allocations for 
educational research in the United States which uninhibited communication of 
ideas and the extensive exchange of researchers has induced. One can argue that 
the pursuit of knowledge knows no bounds, but at the same time it would be 
foolish to discount the effects of the larger environment on the quality and 
quantity of research effort. With this presupposition in mind, an examination of 
the articles which have appeared in AJER suggests that there are at least three 
main sources of influence. Besides the national and international events, policies, 
public attitudes—the general milieu in which researchers do research in 
education—there are the ways in which researchers interact with one another and 
with the professional and public monitoring of their activities. More specifically, 
the AJER articles may be viewed as a fairly faithful reflection of the ways in 
which concepts about research in education have developed over a particular 
period: notions about what counts for valid knowledge, the appropriateness of 
various techniques and technologies, the interplay between the world of ideas in 
the social sciences and humanities and in education, the dissemination of research 
findings, and the effects, if any, of their application on practice. 


The ecology of educational research in Canada 

An ecological view of educational research by definition places emphasis on 
development in relation to total environment. Research in education, like research 
in the social sciences generally, grew rapidly following the Second World War. 
While social scientists claimed that their research would provide an understanding 
of humankind in general, which in turn would lead to the formulation of solutions 
to social problems, researchers in education focused on greater understanding of 
people in schools in order to formulate solutions to educational problems. 
Educational researchers in universities shared with their counterparts the 
conviction that analyses of processes and mechanisms would provide the 
knowledge required to resolve problems and to invent better policies and 
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practices. Governments viewed education as an economic investment by society 
that would produce better individuals, intellectually and personally, who in turn 
would be better citizens. Funding for research was increased both to help resolve 
social problems through education and to contribute to the quality of preparation 
for productive work and thought in Canadian society. Thus it was that 
educational research activity changed dramatically by the mid 1950’s. It was no 
longer seen to be merely an activity to be carried out by professors of education 
during time not devoted to teaching: instead it was thought to be an instrument 
to be used to cope with the present and to influence the future. 


At the same time, international events took place, heightening awareness of 
philosophical and scientific thought in education elsewhere, particularly in the 
Soviet Union. Reports and studies of differences in theories and practices and 
their resulting effects became sources of ideas for possible improvements in 
Canadian education. Later, when radical changes in American equity laws were so 
painfully being born, the course of educational research began to take on new 
directions. A preoccupation with ways to predict group achievement and to 
standardize treatments of groups began to give ground to a search for ways to 
provide equality of opportunity through education which would offset the effects 
of poverty and discrimination and would take into account the characteristics of 
individuals. 


Thus one finds in the earlier volumes of AJER an article by Sparby (1959) 
addressing the relationship of research in education to the social sciences and the 
value of using an interdisciplinary approach, succeeded by a series of reports of 
research studies which depended heavily on social role theory to examine the 
behaviours of students, teachers, and administrators. The Greenfield and Andrews 
(1961) and Keeler and Andrews (1963) reports are typical of those studies 
undertaken with the specific intent of conducting educational research in order to 
improve the quality of schooling. The focus at the time was on the effectiveness of 
actors in fulfilling designated roles in educational organizations, but five years 
later, one of those authors (Greenfield, 1968) reassessed the use of the concept of 
role behaviour in educational theory and research and suggested new directions 
for future research. Or one can find, beginning with MacDonald (1959) a 
continuing interest in Soviet education: Stewin and others (1969, 1974) 
investigated the Soviet theory of set; Smith (1973) looked at Soviet studies in 
language acquisition; and Gulutsan (1967) was concerned with the relationship 
between the ideas of Piaget and Soviet psychologists. Similarly, a change over 
time appears in articles about tests and testing. In 1956, Clarke and Dunlop wrote 
about “Tests in Measurement and Evaluation” and studies on the use of 
standardized tests of achievement such as Sangster’s (1956) gave way eventually to 
research growing out of the criticism of testing related to such factors as the 
effects of culture (Safran, 1963), anxiety (Frost, 1965), creativity (Cropley, 1967), 
and divergent thinking (Vernon, 1972). Gradually the ideas of Jean Piaget became 
more and more prominent in the studies appearing in AJER: in 1960, Crawford 
reported a study of Piaget’s thought in relation to teaching school mathematics; in 
1967, Coté’s study of Piaget’s formulations on perception and _ intelligence 
appeared; by 1973, O’Bryan and MacArthur had made an analysis of Piaget’s 
notion of reversibility and Lefrancois had investigated Piaget’s model of 
equilibrium through adaptation. 
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It can of course be argued that these themes appear in the pages of AJER 
largely because they reflect the choices of the referees and the editors of the day. 
To some extent that may be the case, but that is too simple an explanation. Final 
selections by editors are made from a pool of articles available, and that pool is 
the result of the research activity of people who have pursued ideas or who may 
have been supported in their pursuit of those ideas for a variety of reasons. It is 
not surprising that some of the research done in education and some of the 
motivations of researchers in education are related to social and political factors 
much larger than the educational enterprise itself. Sources of funding and 
demands for research in education which come from outside of the universities 
illustrate this point. The effects of such interventions are most apparent in the 
work in educational administration associated with the University of Alberta. 


In the preface to Swift’s (1970) volume entitled Educational Administration in 
Canada, Stewart describes the people who were involved in the process of 
establishing a special centre for studies in school administration at the University 
of Alberta. He cites the following as influential factors: an expressed concern 
among members of the Canadian Education Association in the first decade after 
the War that more facilities for post-graduate study in education be made 
available in this country; a disposition among Deans of Education in Canada to 
cooperate to that end and their attitude of good will toward the possibility of 
development at the University of Alberta under the leadership of H. T. Coutts, 
Dean of the Faculty of Education; the support of the Alberta Minister of 


Education, the University Board of Governors, the President of the CEA (in office 
during 1954-55); and Dr. Arthur Reeves, the person “who brought the scheme to 
fruition” by preparing and promoting the proposal for the CEA-Kellogg Project 
(pp. iii, iv). The W. K. Kellogg Foundation offered support for an action program 
which would have an immediate and beneficial effect on the educational process and, 
perhaps more important, which would serve to set in motion programs which, having 
demonstrated usefulness, would be taken over, and be financially supported on a 
continuing basis by Canadian school authorities and educational institutions. (pp. 
13-14 
ee as Swift suggested, a program of research was undertaken which 
was unlike the kind of research “as that term was generally understood.” Examples 
of some of the studies completed in large part because of the mandate given to 
the Department of Educational Administration at the University of Alberta, 
funded by the Kellogg Foundation initially, and later by the University’s own 
resources, are to be found throughout the pages of AJER. The articles provide 
some evidence that the concerns of practitioners, as voiced by the Canadian 
Education Association, and the concerns of informed members of the larger 
public, as represented by the Kellogg Foundation, had an effect on the course of 
research in education. In this case the interdependence among groups was overtly 
welcomed by Dean Coutts and the Faculty of Education, and the outcomes have 
been documented in considerable detail by Swift. One suspects that in other areas 
of educational research, “outside” influences have not been recognized as clearly 
but are nevertheless no less active in shaping educational research efforts. 


The influence of the scholarly community , 
Another of the factors shaping the nature and direction of educational research 
is almost certainly the type of program designed to prepare researchers (Willson, 
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- 1980). The majority of the programs offered in North America twenty-five years 
ago were embedded in the requirements and traditions of university faculties of 
graduate studies. The practice continues to a great extent today. The overall 
result is that the ideas of the scholar-models in universities who supervise theses 
and who are members of thesis committees have a long-term influence on the 
types of problems addressed by researchers in education and on the means of 
exploring them. With the growth of graduate programs in the faculties of 
education in Canada, more professors and an increasing number of graduate 
students carried on this academic tradition and, quite naturally, shared beliefs 
and assumptions about the research enterprise. It also seems reasonable to 
suppose that a centre like the Division of Educational Research at the University 
of Alberta, in the process of facilitating research activity, has been instrumental in 
reinforcing ideas and developing standards of excellence in educational research. 


In some cases the traditions are longstanding and deeply rooted, such as in the 
field of testing and test development, and researchers have consistently addressed 
certain basic issues over time. In 1956, AJER published Powell’s evaluation of the 
“Detroit Beginning First-Grade I.Q. Test” in terms of its validity, reliability, and 
standardization. In 1974, an article by Randhawa, Hunt and Rawlyk reported a 
study which included, among other things, an examination of the reliability and 
validity of the “The Canadian Cognitive Abilities Test.” These and other studies 
focusing on the technical aspects of perfecting tools-to-be-used share certain 
characteristics handed down through a long line of researchers. Walker’s article in 
the 1959 volume, assessing the influence of Binet after half a century had elapsed, 
was possible partly because certain presuppositions about the nature of learning 
and testing remained relatively constant among groups of researchers. This is not 
to imply that the field remained static, but that there had been a certain kind of 
continuity. Changes were introduced when some researchers became concerned 
with studying the effects of tests-in-use (Frost, 1965), or when the need for tests 
based on different kinds of assumptions about the nature of learning and thinking 
came to be held by the scholarly community (Hobbs, 1975). But, overall, it can be 
said that those scholars in education who see themselves primarily as researchers 
concerned about the application of the methods of social science research to the 
problems of education have actively sought to improve the quality of their 
research and to strengthen programs for training researchers. 


What may be seen to apply to the influence of the whole community of 
scholars in education nationally and internationally may also be seen to affect the 
research activities of groups of scholars who share an interest in a particular 
subject area or, on a small scale, among scholars located in a single institution. 
When fairly stable associations are maintained over the years, expectations for 
research are built up which are productive and influential for students and 
colleagues alike. One sizeable group of articles published by AJER seems to 
illustrate this phenomenon. Beginning in 1956 and _ continuing regularly 
throughout the pages of AJEHR, there are reports of studies on school mathematics 
programs, the preparation of teachers of mathematics, and the performance of 
students doing various types of mathematics tasks. The researchers whose names 
are associated with some of these studies seem to indicate the effects of 
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longstanding associations among researchers in mathematics education: Nelson 
and Worth, 1960; Nelson, 1964; Nelson and Towler, 1967; Liedtke and Nelson, 
1968; Bana and Nelson, 1977; Bourgeois and Nelson, 1977; Nelson and Kieren, 
1977; and Kieren and Nelson, 1978. 


Another facet of this phenomenon of commonality in research interests and 
the effects of leadership on research directions seems to be apparent in the body 
of work published by AJER related to Soviet scholarship. The heritage of western 
Canada in particular has been enriched by its Eastern European roots and by 
those Canadian scholars who have maintained communications with European 
scholars, specifically with their Soviet counterparts. At the University of Alberta, 
Gulutsan (1967) was a founding member of the interdisciplinary Centre for 
Eastern European Studies and it is noticeable that several of his colleagues in the 
Department of Educational Psychology, coming themselves from a variety of 
research traditions, produced a number of studies analyzing various aspects of 
education in the Soviet Union. MacDonald (1960) wrote “Explanation and the 
Neo-Marxist Theory of Mental Testing” at a time when the race to explore space 
was reaching its competitive heights. A decade later, Hritzuk and Janzen (1970) 
studied the concept of set held by two Soviet psychologists, Luchin and Uznadze, 
followed by Lefrancois’ (1973) account of behaviour theory, and by Cowper and 
Stewin’s (1974) experimental critique of Soviet set theory. Other psychologists, 
like McLeish (1974), examined certain features of Soviet education from a social 
system perspective or, like Stewin and Martin (1974), from a developmental 
perspective. It would be dangerous to ascribe disproportionate effects to the 
influence of scholars on one another, but it does seem reasonable to identify 
personal interests and group contributions among those factors which, for 
whatever sets of complex reasons, have led researchers to follow one course of 
inquiry rather than another. 

This same factor seems to be present in part in the process of change which 
can be seen in the research on teaching and teacher preparation during the past 
twenty-five years. At a time of teacher shortage in North America, Clarke and 
Pilkington (1955) and Aikenhead (1956) investigated the reasons why students 
and practising teachers were not choosing to enter or to remain in the profession. 
Their studies used questionnaires designed to assess the opinions and attitudes 
held by teachers and others. In the next decade the focus of research shifted to 
teacher performance in the classroom and attempted to identify the kinds of 
knowledge and behaviours associated with successful teaching. So, for example, 
Eddy (1963) studied “Teacher Characteristics and Social Studies Achievement” 
and McBride (1964) investigated student assessments of teacher performance 
based on the assumption that when students are generally satisfied with their 
experiences in a classroom they will derive greater benefit from the teaching and 
the learning experiences offered to them. By the mid-1970’s, the notion of the 
effect of teachers’ attitudes towards pupils and teacher-held expectations for pupil 
performance became predominant in the research. Lavoie and Adams (1974) 
conducted a field study to ascertain the ways teacher expectations may be affected 
by children’s physical attributes, sex, and conduct. Hoxter (1974) investigated 
teacher attitudes to find out how these affect students who come from cultures 
other than the teacher’s own. Miller (1974) studied changes in student teacher 


26 


N. Hersom 


attitudes towards children and schooling, reflecting the changes in values held by 
students throughout the period of preservice professional education. Recent 
research has continued the emphasis on the internal rather than the external 
factors related to success in teaching. Where the work of Clarke and Pilkington 
(1955) emphasized the effects of public opinion and of home and school influences 
on prospective teachers, Noll, Willower and Barnette (1977) directed attention to 
the teacher as a person by using measures of “self-actualization” and measures of 
attitudes in terms of “pupil control ideology” to predict the nature of teaching 
behaviours, and Cheong and Wadden (1978) used measures of teacher attitudes to 
predict effects on pupil self-concept. 


If my initial presupposition holds true, that AJER can be seen as a sort of 
weathercock for directions in the developments taking place in educational 
research in North America, then the articles to be found in its twenty-five 
volumes to date reveal something of the state of the art in Canadian research on 
teaching and teacher education. Although one would expect to find many 
educational researchers making the investigation of teaching and _ teacher 
preparation programs central to their work, especially since so many are 
conducting their studies in the context of professional faculties, this does not seem 
to be the case. The studies appear to be so diverse that it is difficult to find ways 
in which research findings could contribute over time to the building of an 
adequate body of knowledge. Some possible explanations for this state of affairs 
seem to emerge from an overall examination of the reports of research appearing 
in AJER. One observes that there is an almost exclusive association of researchers 
within a discipline or a field of study but little association across disciplines. 
While inquiries by researchers in educational philosophy, sociology, psychology, 
and history or administration and curriculum are ongoing, few of the studies have 
been undertaken in an attempt to work across discipline lines or to synthesize 
findings in order to discover general principles or theories. Work like Lindstedt’s 
(1960) study on “Teacher Qualifications and Grade IX Mathematics” identified 
certain features characteristic of preservice teacher education and subsequent 
teaching experience in schools that can affect the levels of competence attained by 
teachers of mathematics. Greenfield and Andrews (1961) identified certain types 
of teacher behaviours that can be associated with pupil achievement on school 
examinations. There seems to be little evidence of concern on the part of the 
community of educational researchers to take the findings of such studies 
consciously into account when planning subsequent investigations. Even though a 
researcher like Poole (1975) set out to find ways to improve teacher education 
programs by identifying component skills of teaching and by deliberately training 
people to generate ideas on the assumption that this would make for more 
creative teaching in the classroom, the study was conducted on a relatively small 
scale without benefit of cross-consultation with other researchers such as I have 
envisaged. Of course, one might argue that a great deal of the research conducted 
in education implies the need to know more about teaching and teacher 
preparation and thus contributes to our knowledge indirectly. But, historically, the 
resolution of difficult problems in any field of inquiry has not come about by 
circumventing them. Obviously, different researchers will use different approaches 
in searching for new knowledge and may even be in sharp disagreement with one 
another in their views of the best approaches to take, but they share in a common 
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pursuit and the influence of the community of scholars makes itself felt. In the 
community of scholars in education there appears to be lacking a continuing, 
persistent commitment to the kind of research that boldly identifies itself with the 
intent to pursue new ideas, to invent new practices, to resolve longstanding 
problems which would lead to the improvement of university and field programs 
in teacher education. 


The traditions of the scholarly community can be tremendously productive but 
there is a danger that those same traditions will become paternalistic and 
conformist in their effects on research. The apprentice researcher who is 
introduced to a particular mode of conceptualizing problems-to-be-solved in 
education may find it difficult to accept or to explore other ways of thinking and 
to use alternative modes and techniques of investigation. The myth of “the 
scientific method” has influenced research in education in North America as it has 
in other fields of inquiry and has played a predominant role in the research in 
education reported in AJER. 


The extension of knowledge in education through research 


To this point the AJER has been viewed partly as a product of its time and 
place and partly as an illustration of the several ways the community of scholars 
in education shapes the work that is ongoing in the field of inquiry. Undergirding 
both perspectives are certain basic notions about what educational research is and 
what counts for valid knowledge in education. Is research in education basic or 
applied and, if it is thought to be applied research, does that mean it is applicable 
to further academic study or to practice, or to both? Baxter (1980) argues that 
research in the applied sciences is “basic in the sense that it is concerned with 
knowledge and methods that might be applicable in a number of different 
situations” and thus it enjoys a kind of strength and versatility which would not 
be possible if it were highly specific (p. 3). Others would draw the line more 
clearly between basic and applied research. Mattersich (1980) distinguishes basic 
from applied research by emphasizing its intents: when research is directed toward 
increasing knowledge it can be termed basic; when it is directed toward practical 
applications of knowledge, means-end relationships, or the resolution of specific 
problems, it is applied research. Using the Mattersich definitions, it can be seen 
that AJER has interpreted its mission to publish research in education broadly 
enough to include studies that lie within each of the two categories of 
intentionality. It can also be seen that the greater number of studies lies in the 
application of knowledge category. 

A selection of 25 of the studies on reading reported from 1955 to the present 
illustrates this weighting. The earlier studies such as the one by Carmichael and 
Rees (1955) surveyed reading achievement levels attained by Alberta pupils in 
order to discover whether or not they were meeting standards attained elsewhere. 
The results of comparisons like these were intended for use by practitioners who, 
presumably, would take them into account when planning curriculum and 
instruction in reading. An experimental study of Koziey and Brauer (1972) 
described methods of teaching reading which demonstrably improved reading 
performance. These and other studies in this particular group of articles were 
clearly intended to assist practitioners with the resolution of problems 
encountered by teachers in the teaching of reading or by pupils in learning how to 
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read. By comparison there are relatively few articles reporting research 
deliberately devised to discover new knowledge about reading and language 
phenomena. Stewin and Anderson (1974) tested the power of the concept of 
cognitive complexity in information processing; Evanechko, Armstrong and 
McFetridge (1974) explored the semantic organization underlying verbal 
behaviour; Cummins and Gulutsan (1974) focused on cognitive growth; and Smith 
(1973) searched for understanding in terms of language acquisition concepts. 


Whether AJER articles report findings from applied or from basic research 
studies, one is struck by the consistent emphasis placed on matters of method and 
technique in the majority of them. Perhaps this is not too surprising in view of 
the metamorphosis which has taken place in social science research in the 
post-war years. The earlier volumes of AJER contain reports of studies that are 
predominantly descriptive presentations. These are succeeded by reports of 
studies which used analytical-empirical approaches in the attempt to adapt 
scientific methods to education problem-solving. From the mid-1960’s on, there is 
concern for more rigour, and for more emphasis on the development of theories 
and models (Holdaway & Maclver, 1966), methodology (Maguire, 1969), and new 
techniques (Hunka, 1971). It was during this period, according to Gephart (1980), 
that researchers in education became preoccupied with five major types of 
problems in the conduct of research: problems of analysis, treatment, 
representativeness, measurement, and logic, which even yet have not been 
satisfactorily resolved. One detects in a few of the articles published after 1970 the 
adoption of a different philosophy of science, one which takes into account the 
researcher as person, an integral part of the research enterprise. Nevertheless, the 
research methods and techniques employed most frequently in the articles 
appearing in AJHR are those which were developed originally for studies in 
agriculture, biology, or psychology.' It is exceptional to find reports of studies 
making use of interview, historical, case study, content analysis, opinion poll, or 
survey sampling. Of fifty-four articles published in AJER from 1955-1979 which 
designate teachers or teaching as a research focus, only six reported the use of 
such techniques as economic prediction, field study, Delphi study, case study, or 
interview procedures. 


Educational research has come to be almost exclusively associated in the minds 
of some people with the use of systematic descriptions and measurement, 
observation, and controlled experimentation, or the creation of instruments to test 
knowledge through verification, confirmation, or falsification. Proportionately 
larger numbers of educational researchers in North America have been committed 
to the use of this particular methodology, resulting, one would expect, in a large 
pool of articles from which to select those for publication. Thus, while the articles 
in AJER represent the prevailing trends in educational research for the past 
twenty-five years, at the same time they provide a silent commentary on the 
comparatively limited number of research studies conducted using other forms of 
inquiry. This makes for an imbalance that skews the image of what counts for 
research in education. Eastwood’s (1975) comment appearing in Volume XXI 
seems particularly pertinent in this regard: 

No one would wish to suggest or be accused of suggesting that a technological 
research pattern is an adequate or appropriate model for educational research.... 
Human learning and human development are considerably more complex processes 
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than the development of a new machine. But both are practical activities dependent 
upon the application and utilization of basic research and both necessitate research in 
ways of applying. (p. 74) 


The effects of publishing educational research 


The experience of delving into twenty-five volumes of AJER has forcibly 
reminded me of the diversity of questions confronting researchers in education. 
Some are conceptual, others are methodological, and still others have to do with 
the relationship between research and practice. Clearly, the raising of such 
questions as a stimulus for further thought and research activity is one of the 
primary functions of the Journal and one it has performed well for its readers 
over the years. It is consistent with the statement of purpose set out in the 
masthead: AJER is intended to be 

A quarterly journal devoted to the dissemination, criticism, interpretation and 
encouragement of all forms of systematic enquiry into education and fields related to 
or associated with education. 

My overview has indicated a few of the ways AJER has so admirably fulfilled 
these goals. It has also served to whet my appetite for more information about 
what the field of educational research was like at the time each of the twenty-five 
volumes was in process of preparation for publication. Editors of major research 
journals such as AJER read the assessments made by knowledgeable reviewers of 
hundreds of manuscripts; they make informed decisions about selections; they 
interact with authors representing research in disciplines using different forms of 
inquiry. I think it would be valuable to know more about the issues in educational 
research that consumed the interests of authors and editors of the day. I can only 
speculate a little about the effect that the inclusion of editorial statements might 
have had. If editorials serve to highlight issues important to the mission of a given 
publication, might the editors of AJER have been able to communicate some 
sense of the issues and events and the world of ideas in educational research? 
Might they have sharpened insights or created linkages that could have intensified 
or multiplied the effects of ongoing research? Would such editorials have helped 
us to appreciate their perceptions, so well described by Hodysh (1971) as the 
compound of needs, values, hopes, past experience, and culture used to organize 
and to direct interpretations of reality? 

I applaud the invitation first extended in Volume 24 to provide opportunities 
for scholars to write critically about the general conduct of research in education 
(Hodysh, 1978). Essay reviews, book reviews, and expressions of points of view in 
“Perspectives” pieces are being included in a deliberate effort to present the 
questions being raised in the larger world of research, the debates about premises, 
the shifts in what is taken for granted in educational research. Even so, there are 
problems in publishing matters of current concern, it takes time for the cycle of 
the research process to come round to the publication stage. This publication 
time-lag continues be a dilemma in disseminating ideas and research results 
because of the benefits that accrue from criticism and revision prior to 
publication. The provision for alternative types of articles should encourage those 
who would challenge our existing ways of thinking and those who can generate 
excitement about the search for knowledge. 
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But having said these things, I must conclude by acknowledging the 
far-reaching positive effects of the research articles published by AJER. There is 
very substantial evidence in articles appearing over successive years demonstrating 
that the conduct of research in education has been facilitated by publication of 
work leading to the refinement and improvement of measures and methods. There 
is also evidence along the lines Suppes (1978) has identified for North America 
generally, that knowledge derived from basic research has had an effect on 
practice in education: policy decisions about the content and organization of 
curricula in science and mathematics have been influenced by Piaget’s notions of 
conservation; certain expectations about students and teachers have been modified 
in light of findings in social psychology; and work in the social sciences and 
humanities has changed some concepts about the training of administrators, to 
take just a few examples. AJER came into being at a time when the vision for the 
development of educational research in Canada was fresh and exciting. It is 
impossible to assess precisely the extent to which the Journal helped to translate 
that vision into reality, or to gauge exactly the extent to which the vision itself 
shaped the development of AJER. Traces of both of these factors are to be found 
throughout the twenty-five volumes of the Journal. It is a singularly fine record of 
many of the achievements and of some of the struggles in educational research in 
Canada and abroad. 

And what of the next twenty-five years for AJER? At a time when large 
numbers of publications in every field are in peril of disappearing, my hope is that 
AJER will continue to flourish. From its earliest beginnings to the present time, it 
has played an important role in the whole of the educational research enterprise 
in Canada It may be that this role can be sharpened somewhat. The time seems 
propitious, with the appointment of the panel of widely representative Consulting 
Editors, for the Journal to assume responsibility for actively identifying gaps in 
knowledge and needed research in education. AJER might deliberately assume 
responsibility for bringing together a critical mass of knowledge from which the 
members of the teaching profession as well as scholars in education could draw 
applications. Or, perhaps, the Journal might consciously assume a special 
responsibility for disseminating the results of research studies that will further, in 
the broadest sense, our understanding of teaching and schooling and the education 
of practitioners. 

The resilience demonstrated by AJER in its first twenty-five years of 
publication will surely stand it in good stead now. It has been characterized by a 
sense of purpose and by those qualities of vitality and perseverance which are 
needed to sustain it through periods of change. These are qualities that will 
enable AJER to continue to make important contributions to the study and 
practice of education in Canada in the years ahead. 


Notes 


L For 2 useful comparison, we V. L. Willson, Research Techniques im AERJ articles 1969-1978 Educational 
Researcher, 199), #6). 5-10. 
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RESEARCH NOTES 


A NOTE ON PUPIL EVALUATION IN THE SOVIET UNION 


Success in North American educational programs is frequently described in 
terms of the differences (hopefully significant at the .01 level and in the 
anticipated directions) between pre- and posttest ratings of student ability on 
selected measures. The administration of standardized tests has become, within 
the present century, an intrinsic activity of most classroom teachers, counselors, 
consultants, and other specialists employed by school systems and is the raison 
d’étre of various professional groups and organizations concerned with the 
performance of this function. Although the merits of many assumptions and 
practices associated with pupil evaluation are currently being challenged, 
assessments of both academic achievement and mental ability. continue as an 
established tradition within most school systems. 


The rationale underlying our use of achievement tests in educational 
institutions is largely self-evident; the rationale relating to the use of psychological 
tests, however, requires some additional comments. Generally, justification for the 
use of these latter measures is based on the premise of providing a comprehensive 
picture of the student, thereby allowing the teacher, counselor/psychologist, or 
school administrator to devise a more individualized curriculum or program geared 
to the needs and levels of functioning of that person. In most instances pupil 
evaluation in our educational system is dualistic in nature, and we attempt to 
gauge student progress (academic achievement) in terms of individual capacity 
(mental ability). 


This approach to pupil evaluation is, however, not universally employed, and 
the current controversies regarding the efficacy and equitability of our procedures 
suggest that a consideration of alternative systems may be appropriate at this 
time. Concepts and practices regarding differences in academic achievement in the 
Soviet Union, for example, are quite distinct from our own (Chauncey, 1959). 
Within the USSR, the only form of student evaluation permitted (except in 
exceptional and individual cases) is achievement testing, and the measures 
employed, even in this area, are substantially less standardized and controlled 
than are the batteries of examinations periodically administered to pupils in 
western schools. The rationale underlying the rejection of psychological tests in 
the USSR has been discussed elsewhere (Stewin, 1978) and will not be dealt with 
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here; instead, the purpose of the current paper is to present a summary of the 
evaluative procedures practised in the Soviet school system, followed by some 
comments regarding this approach in light of our current concerns surrounding 
student assessment. 

Although the majority of children in the USSR attend a nursery (for 
six-month- to three-year-olds) or kindergarten (for three-year- to six-year-olds) 
program for at least one year, there seems to be little emphasis on academic 
readiness at this level and no formal evaluation of individual abilities (Jacoby, 
1974). The program seems geared to the development of gross and fine motor 
skills (through play and games, dance, and artistic activities) and to the 
stimulation of group social skills. Students who are judged to exhibit exceptional 
aptitude for art or dance may be encouraged to develop these capacities through 
extra training, and student needs and progress may be discussed with parents on 
occasion, but no standardized evaluations and reports are completed. 


Since the revision of 1964, Soviet schools offer ten years of formal instruction. 
The first three years are considered to be elementary school and the latter years 
are the equivalent of North American junior and senior high school programs. 
Fight years of education are deemed basic for each pupil and are therefore 
compulsory. Eligible students may then elect to continue their education for an 
additional two years and perhaps to apply for admission to a university or other 
specialized (technical, pedagogical, foreign language, medical, fine arts, music, 
military) institute; less successful students generally enter the work force at the 
end of the eight-year program. 

Children in the Soviet Union begin compulsory school attendance in the term 
following their seventh birthday and “are judged according to a rigid grading 
system, beginning with the first week of the first grade” (Jacoby, 1974, p. 75). 
Overall progress is evaluated via oral participation in class, daily homework 
assignments, and periodic classroom examination. Throughout the USSR, grading 
is done on a five-point scale, three being the minimal passing grade. Grades are 
entered daily in each student’s record book and must be reviewed and confirmed 
by parental signatures at the end of each school week (Bereday, 1960, pp. 
178-179). Should a student receive more than one low mark in a week, his parents 
may be requested to discuss the child with school authorities and to supervise 
their child’s homework more stringently. This “daily checking of the pupil’s 
knowledge allows the teacher to form an objective opinion as to his level of 
academic achievement at the end of the school year” (Strezikozin, 1969, p. 152) 
and is thought to eliminate the need for more standardized or comprehensive 
examinations to be administered at the end of term, such as are common in 
western countries. End-of-year examinations are, however, administered in some 
areas of the Soviet Union (Ukraine, Georgia, Kazahkstan) at the discretion of local 
school authorities (op. cit., p. 153). 

Student progress is evaluated in each subject area on the basis of _teacher 
judgment, augmented as needed by examinations, and promotion decisions are 
reached. If a pupil is assigned a failing mark in more than two subjects he must 
repeat the grade, but if he fails fewer subjects, he may be allowed to attend 
summer school (at the secondary level) and his status is then reviewed. 
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Compulsory examinations are administered only at the end of each student’s 
secondary examination, following either his eighth (incomplete) or tenth 
(complete) year of schooling. Eight-year program students may be tested in 
Russian language and mathematics and in the native language in non-Russian 
areas; examinations may be in both oral and written form. Students who have 
completed the ten-year (matriculation) program may also be examined in Russian 
literature, a foreign language, Russian and foreign history, physics, chemistry, 
geography, algebra, geometry, and/or social sciences. “The examination in Russian 
and mathematics is written. All other examinations are oral” (Kreusler, 1976, p. 
46). 

Written tests consist largely of essay and short-answer questions; however, 
Ryumshin (1976) has noted a substantial increase in the use of the multiple-choice 
format on recent biology examinations. The written examinations are centrally 
prepared and are made available to the schools only one or two days before they 
are to be administered. Tests are supervised and results are evaluated by local 
school authorities according to guidelines provided by a central committee. 


The procedure concerning administration of the oral examinations is quite 
different. Several weeks before the examination date, pamphlets are circulated to 
students announcing the subjects on which oral questions will be based. Subject 
area teachers then formulate specific questions and place them on cards known as 
Bilety; there are generally three questions per card. (Sample questions may be 
found in Apanasewicz and Rosen, 1966, and in Kreusler, 1976, pp. 47-49.) Oral 
examinations are conducted by a panel consisting of the subject teacher, the 
subject teacher from a second school, and the school director (administrator). 
Pupils are called before the panel one at a time. Each student draws a card and is 
allowed approximately 15 minutes to formulate his responses to the questions 
printed on the card; he is then given 10 minutes to reply. His responses are 
evaluated by the panel and a grade is assigned. (For a more detailed account of 
oral examination procedures and a discussion of their strengths and weaknesses, 
the reader is directed to Gloriozov, 1974.) 


Students who successfully complete (pass) all matriculation examinations are 
awarded a Certificate of Maturity, and students who perform exceptionally well 
are awarded medals. Pupils who receive marks of five in all tenth year subjects 
are presented with gold medals, and those who receive marks of five in 
matriculation subjects and four in other areas are presented with silver medals. 
Gold and silver medalists were, at one time, given preferential treatment with 
respect to admission to postsecondary institutions (Shumilin, 1962); however, 
Grant (1964) and Skurnik (1968) note that all students are now required to write 
entrance examinations when applying for further education. 


In summary, students in the Soviet school system are evaluated on a daily 
basis in a variety of subject areas and are assigned ratings ranging from one to 
five. Grades are allocated by the teacher after considering the pupil’s responses in 
the classroom, the adequacy of his homework assignments, and the knowledge 
displayed on periodic subject matter tests. This information is deemed sufficient 
for making promotion decisions in most instances, and standardized academic 
examinations are administered only to students completing their secondary 
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education. Final examinations in specific subjects may be written (supplied by a 
central education committee), oral (formulated by local subject matter teachers 
with guidance by the central committee), or both. Students who perform 
exceptionally well on these tests are recognized by the state through the 
presentation of medals. 


In contrast to present practices in many countries, Soviet pupils are evaluated 
solely on the basis of academic achievement relative to the curriculum for their 
grade level and are not judged relative to their peers in terms of their own 
capacities. Differences in achievement are, in the absence of obvious physical or 
physical-mental handicaps, generally ascribed to individual interests (and 
consequent motivational levels) or to the quality of instruction provided rather 
than to differing aptitudes for learning. Little, if any, direct attention is given to 
other factors which may inhibit the learning process and little individualization of 
instructional approach, in consideration of the abilities or learning style of the 
student, is advocated for the regular classroom teacher. 


Adherence to a unidimensional and rigid system of grading and the lack of 
standardization (or confirmation) of teacher evaluations regarding pupil 
competencies has created some difficulties for teachers and administrators within 
the Soviet school system; there has been some agitation in the press for a review 
of policy concerning testing (Schlesinger, 1964, 1967, 1972). As early as 1961 
concern was being expressed that there should be a more objective determination 
of students’ abilities and that these capacities should be identified early so that a 
better selection of students would be made by various institutions of higher 
education (De Witt, 1961). Soviet writers seldom specify the nature of the 
assessment procedures they are advocating, but a review of the literature (Stewin, 
1978) suggests that reintroduction of some forms of both aptitude and 
standardized achievement measures seems plausible. 

Given the current Soviet perspective, it seems unlikely that a major change in 
assessment practices will occur; however, the demands of technology may result in 
the introduction of some measures designed to identify the scientifically gifted 
individual. Also, governmental policy in regards to the needs of particular 
students for specialized programming or instruction on a compensatory basis has 
shown a shift in recent years, and some educational programs allowing the system 
to accommodate the exceptional learner have been introduced (Gallagher, 1974). 
This trend may result in the development and introduction of more 
comprehensive and, perhaps, more standardized assessment procedures than are 
currently employed in the USSR. 

It is hoped that this brief survey of the evaluative procedures employed in 
Soviet schools and of the concerns expressed by educators in that setting has 
offered some insights to the reader familiar with western assessment procedures. 
The present controversy in North America regarding the equitability of our 
established testing programs and criticisms of our over-reliance on assessment 
data have caused some observers to advocate a severe restriction or even 
discontinuance of standardized testing within school systems. This suggestion 
would seem, in consideration of the Soviet experience, somewhat extreme. 
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Perhaps the conclusion to be reached upon reviewing the assessment practices 
in both situations would be to substantiate that standardized measures should be 
employed judiciously to augment, rather than to replace, sound teacher judgment. 
Reliance on any one source of information regarding pupil progress, whether 
obtained via objective measurement or by classroom oservation, is insufficient to 
permit educational staff to monitor adequately existing programs and to provide 
learning experiences appropriate to the needs of individual students. Only a 
careful compilation of both quantitative and qualitative knowledge seems 
sufficient to satisfy these rigorous requirements, foremost in the minds of 
concerned educators of all political persuasions. 


L. L. Stewin 
Department of Educational Psychology 
The University of Alberta 
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BOOK REVIEWS 


CURRICULUM INTEGRATION AND LIFELONG EDUCATION. By 


James B. Ingram. Volume 6 in Advances in Lifelong Education Series. Hamburg: 
Pergamon Press and the UNESCO Institute for Education, 1979. 


Volume 6 in the series Advances in Lifelong Education is a credit to the 
painstaking concern for detail and explanation of the author, James B. Ingram. 


The thoroughly rational and sequential ordering of the text is reinforced by its 
strong academic content on the nuances of curriculum integration. Ingram begins 
with the qualification that the text is primarily concerned with the concept of 
curriculum integration as a necessary precursor to, and component of, integrated 
and coordinated lifelong education. 


The first and final chapters specifically relate curriculum integration to 
lifelong education, while the five intervening chapters deal with particular aspects 
of the concept of integration: chapter 2, “The Nature and Variety of Curriculum 
Integration”; chapter 3, “Functions of Integration”; chapter 4, “Teaching Styles 
and Integrated Approaches”; chapter 5, “Integration and School Organization”, 
and chapter 6, “Problems and Prospects.” The first chapter, “Lifelong Education 
and Curriculum Integration,” appropriately sets the scene for the subsequent 
analysis, in discussing the perceived relationship between the two concepts, 
followed by separate explanations of what lifelong education is perceived to be, 
and the perceived significance of curriculum integration. The final chapter, 
“Towards Lifelong Education,” discusses the similarity between the goals of the 
two concepts, curriculum integration as a practical outlet for the principles of 
lifelong education, and the transfer of lifelong education principles from, or 
through, the curriculum to the community. 

If any weakness is to be noted in the discussion of curriculum integration with 
lifelong education, it is that the bulk of the text deals with the former, and, as 
indicated, the actual attempted integration of the two concepts addressed is 
restricted to the first and last chapters. Some additional cross-reference in the 
central chapters would, I believe, have provided a more fully integrated approach 
to the topic. Given this perception, however, Ingram has had some success in 
dealing with the somewhat philosophical and abstract nature of lifelong education 
and the more practical and everyday issue of curriculum integration to produce an 
eminently useful commentary on these somewhat unrelated concepts. 


The significant emphasis in the approach to the topic is towards the 
individual rather than the system. Ingram calls for rationality and order to what 
he believes has, to date, been a piecemeal approach. His approach in the central 
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chapters is to take curriculum integration beyond the confines of the traditional 
classroom educational setting to the more broadly based consideration of 
education through life, in terms of the individual, and while some emphasis is 
devoted to organizational considerations, the underlying concern is for the 
individual’s participation and potential gains rather than for organizational 
expediency. 

In chapter 6, some consideration is given to the problems associated with, and 
prospects for, achieving the desired integration. These are viewed from the local as 
well as the political level. Factors affecting the potential influence of curriculum 
integration on lifelong education are presented in the final chapter. It is at this 
point that the major threads of the argument are drawn together. 


Ingram states: “the ultimate purpose of integration relates not to the work of 
schools but to the lives of individuals. If its influence was contained within the 
walls of schools it would be of little moment.” This comment epitomizes the 
stance adopted by Ingram which includes the notion that curriculum integration 
makes a significant contribution not only to educational progress, but also to 
personality and community development. 

The text is both an informative and eminently readable treatise on the topic, 
and is strongly recommended for both specialist and/or tertiary reading in the 
area. 


Ross H. Millikan 
Coordinator of Educational Administration 
University of Melbourne 


HELPING THE RETARDED CHILD IN THE ELEMENTARY 
SCHOOL YEARS. By J. B. Fotheringham and J. Morris. Toronto: Guidance 
Center, University of Toronto, 1979, 148 pp. (ISBN 0-7713-0079-4) 


In an era of increased social awareness of the needs of the mentally 
handicapped, and a revitalized educational thrust in the direction of integration 
and normalization, both public and professional education concerning mental 
retardation has taken on new importance. Fotheringham and Morris have 
provided a worthwhile vehicle to help facilitate this process. The book is not an 
expansive review of recent psychological theory nor is it an introduction to new 
conceptual frameworks. In line with its purpose, the language is neither jargonistic 
nor stilted; it is an attempt to educate rather than extend research frontiers. In 
the preface, the underlying assumptions of the text are defined. The assumptions 
incorporate numerous widely accepted views of the retarded child and, in 
themselves, are a starting point to frame a perspective of mental retardation. 
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It is interesting to note that in one assumption the retarded child is described 
as having “a disorder in his brain’s problem solving skills.” Later, in the same 
chapter, the writers comment that “Generally speaking, the main defect in the 
retarded person’s brain is in the central interpretive mechanisms” (p. 3). Although 
the writers later consider genetic and environmental factors, references early in 
the text to the retarded individual’s “poorly developed brain” might lead the 
reader to assume a defect model of retardation having as its salient feature 
structural disorder. As is clarified later in the text, genetic and social factors 
(independent of physiological defect) account for numerous instances of educable 
mental retardation. 


In describing the causes of mental retardation, the writers relate that 80 
percent of retardation is associated with poverty. Since malnutrition is commonly 
associated with poverty, it might have been worthwhile to accentuate the link 
given the large body of current research concerning protein-caloric undernutrition 
and mental retardation. As the content is directed toward parents as well as 
professionals in areas related to retardation, a description of the results of 
exemplary programs such as that being carried out at the Montreal Diet 
Dispensary might have been a timely inclusion. 


Chapter II on Developmental Stages presents, in essence, a review of Piagetian 
theory as it relates to the mentally retarded child. The content is not novel but is 
well presented and facilitates a basic understanding of Piaget’s description of 
ontogenetic development. Similarly, Chapters III and IV provide an introduction 
to two-phase assessment—formal and informal. Sound pragmatic advice is offered 
to the elementary teacher who attempts to differentiate between situational and 
chronic-organic influences on the child’s level of functioning. A basic information 
processing model serves to point out the intricacy of cognition and the numerous 
potential problematic areas for the retarded elementary school learner. 


In discussing the stresses associated with an elementary school retarded child, 
the vital link between home and school is emphasized. Although the writers 
describe our times as the “era of worship of education and veneration of the 
professional person,” it might be said with some justification that such a view of 
the professional is declining rather than gaining momentum as parents become 
more highly educated and society in general becomes increasingly more informed 
concerning mental retardation. Increased accountability on the part of teachers 
and a more open questioning of school programs will make it even more important 
to perceive the home and school as “necessary partners.” 

One of the most informative aspects of the book is its consideration of school 
arrangements for the mentally retarded child. Rather than advocating an 
immediate wholesale integration of all retarded children into the regular 
classroom, the writers have carefully considered factors such as degree of 
handicap, chronological age, and social maturity. Perceived limitations upon the 
academic integration of moderate or TMR children seem realistic and warranted, 
aligning with the Piagetian developmental model presented earlier. Functional 
requisites to integration related to teacher awareness and school support are 
clearly defined in the discussion of feasibility. As noted, the question is not one of 
whether segregation is unfair and integration fair but rather of “what is best for 


any particular child” (p. 66). 
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Helping the Retarded Child in the Elementary School Years provides a 
meaningful introduction to basic concepts in mental retardation. It helps to dispel 
prevalent myths concerning the retarded child and prompts a questioning attitude 
toward self-acclaimed research such as that of Doman and Delacato. 

The general tenor of the book is positive, looking toward a future in which the 
retarded individual will become increasingly more independent, a participating 
member of society. Certainly social workers, teachers anticipating having retarded 
children in their classrooms, school principals, parents, and others with a keen 
interest in the welfare of retarded children would benefit from reading this book. 


Wayne C. Nesbit 
Department of Educational Psychology 
Memorial University of Newfoundland 
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